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PREFACE. 


The usual point of view in the study of mechanics is that where 
the attention is mainly directed to the changes which take place in 
the course of time in a given system. The principal problem is the 
determination of the condition of the system with respect to con¬ 
figuration and velocities at any required time, when its condition 
in these respects has been given for some one time, and the funda¬ 
mental equations are those which express the changes continually 
taking place in the system. Inquiries of this kind are often sim¬ 
plified by taking into consideration conditions of the system other 
than those through which it actually passes or is supposed to pass, 
but our attention is not usually carried beyond conditions differing 
infinitesimally from those which are regarded as actual. 

For some purposes, however, it is desirable to take a broader 
view of the subject. We may imagine a great number of systems 
of the same nature, but differing in the configurations and veloc¬ 
ities which they have at a given instant, and differing not merely 
infinitesimally, but it may be so as to embrace every conceivable 
combination of configuration and velocities. And here we may set 
the problem, not to follow a particular system through its succes¬ 
sion of configurations, but to determine how the whole number of 
systems will be distributed among the various conceivable configu¬ 
rations and velocities at any required time, when the distribution 
has been given for some one time. The fundamental equation for 
this inquiry is that which gives the rate of change of the number of 
systems which fall within any infinitesimal limits of configuration 
and velocity. 

Such inquiries have been called by Maxwell statistical. They be¬ 
long to a branch of mechanics which owes its origin to the desire to 
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explain the laws of thermodynamics on mechanical principles, and 
of which Clausins, Maxwell, and Boltzmann are to be regarded as 
the principal founders. The first inquiries in this held were indeed 
somewhat narrower in their scope than that which has been men¬ 
tioned, being applied to the particles of a system, rather than to 
independent systems. Statistical inquiries were next directed to the 
phases (or conditions with respect to configuration and velocity) 
which succeed one another in a given system in the course of time. 
The explicit consideration of a great number of systems and their 
distribution in phase, and of the permanence or alteration of this 
distribution in the course of time is perhaps first found in Boltz¬ 
mann’s paper on the “Zusammenhang zwischen den Satzen iiber 
das Verhalten mehratomiger Gasmolekiilc mit Jacobi’s Princip des 
letzten Multiplicators” (1871). 

But although, as a matter of history, statistical mechanics owes 
its origin to investigations in thermodynamics, it seems eminently 
worthy of an independent development, both on account of the el¬ 
egance and simplicity of its principles, and because it yields new 
results and places old truths in a new light in departments quite 
outside of thermodynamics. Moreover, the separate study of this 
branch of mechanics seems to afford the best foundation for the 
study of rational thermodynamics and molecular mechanics. 

The laws of thermodynamics, as empirically determined, express 
the approximate and probable behavior of systems of a great number 
of particles, or, more precisely, they express the laws of mechanics 
for such systems as they appear to beings who have not the fine¬ 
ness of perception to enable them to appreciate quantities of the 
order of magnitude of those which relate to single particles, and 
who cannot repeat their experiments often enough to obtain any 
but the most probable results. The laws of statistical mechanics 
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apply to conservative systems of any number of degrees of freedom, 
and are exact. This does not make them more difficult to estab¬ 
lish than the approximate laws for systems of a great many degrees 
of freedom, or for limited classes of such systems. The reverse is 
rather the case, for our attention is not diverted from what is es¬ 
sential by the peculiarities of the system considered, and we are 
not obliged to satisfy ourselves that the effect of the quantities and 
circumstances neglected will be negligible in the result. The laws 
of thermodynamics may be easily obtained from the principles of 
statistical mechanics, of which they are the incomplete expression, 
but they make a somewhat blind guide in our search for those laws. 
This is perhaps the principal cause of the slow progress of ratio¬ 
nal thermodynamics, as contrasted with the rapid deduction of the 
consequences of its laws as empirically established. To this must 
be added that the rational foundation of thermodynamics lay in a 
branch of mechanics of which the fundamental notions and prin¬ 
ciples, and the characteristic operations, were alike unfamiliar to 
students of mechanics. 

We may therefore confidently believe that nothing will more con¬ 
duce to the clear apprehension of the relation of thermodynamics 
to rational mechanics, and to the interpretation of observed phe¬ 
nomena with reference to their evidence respecting the molecular 
constitution of bodies, than the study of the fundamental notions 
and principles of that department of mechanics to which thermody¬ 
namics is especially related. 

Moreover, we avoid the gravest difficulties when, giving up the 
attempt to frame hypotheses concerning the constitution of material 
bodies, we pursue statistical inquiries as a branch of rational me¬ 
chanics. In the present state of science, it seems hardly possible to 
frame a dynamic theory of molecular action which shall embrace the 
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phenomena of thermodynamics, of radiation, and of the electrical 
manifestations which accompany the union of atoms. Yet any the¬ 
ory is obviously inadequate which does not take account of all these 
phenomena. Even if we confine our attention to the phenomena dis¬ 
tinctively thermodynamic, we do not escape difficulties in as simple 
a matter as the number of degrees of freedom of a diatomic gas. ft 
is well known that while theory would assign to the gas six degrees 
of freedom per molecule, in our experiments on specific heat we 
cannot account for more than five. Certainly, one is building on an 
insecure foundation, who rests his work on hypotheses concerning 
the constitution of matter. 

Difficulties of this kind have deterred the author from attempt¬ 
ing to explain the mysteries of nature, and have forced him to be 
contented with the more modest aim of deducing some of the more 
obvious propositions relating to the statistical branch of mechan¬ 
ics. Here, there can be no mistake in regard to the agreement of 
the hypotheses with the facts of nature, for nothing is assumed in 
that respect. The only error into which one can fall, is the want 
of agreement between the premises and the conclusions, and this, 
with care, one may hope, in the main, to avoid. 

The matter of the present volume consists in large measure of 
results which have been obtained by the investigators mentioned 
above, although the point of view and the arrangement may be 
different. These results, given to the public one by one in the order 
of their discovery, have necessarily, in their original presentation, 
not been arranged in the most logical manner. 

In the first chapter we consider the general problem which has 
been mentioned, and find what may be called the fundamental equa¬ 
tion of statistical mechanics. A particular case of this equation will 
give the condition of statistical equilibrium, i.e., the condition which 
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the distribution of the systems in phase must satisfy in order that 
the distribution shall be permanent. In the general case, the fun¬ 
damental equation admits an integration, which gives a principle 
which may be variously expressed, according to the point of view 
from which it is regarded, as the conservation of density-in-phase, 
or of extension-in-phase, or of probability of phase. 

In the second chapter, we apply this principle of conservation of 
probability of phase to the theory of errors in the calculated phases 
of a system, when the determination of the arbitrary constants of 
the integral equations are subject to error. In this application, we 
do not go beyond the usual approximations. In other words, we 
combine the principle of conservation of probability of phase, which 
is exact, with those approximate relations, which it is customary to 
assume in the “theory of errors.” 

In the third chapter we apply the principle of conservation of 
extension-in-phase to the integration of the differential equations of 
motion. This gives Jacobi’s “last multiplier,” as has been shown by 
Boltzmann. 

In the fourth and following chapters we return to the consider¬ 
ation of statistical equilibrium, and confine our attention to con¬ 
servative systems. We consider especially ensembles of systems in 
which the index (or logarithm) of probability of phase is a linear 
function of the energy. This distribution, on account of its unique 
importance in the theory of statistical equilibrium, I have ventured 
to call canonical , and the divisor of the energy, the modulus of dis¬ 
tribution. The moduli of ensembles have properties analogous to 
temperature, in that equality of the moduli is a condition of equi¬ 
librium with respect to exchange of energy, when such exchange is 
made possible. 

We find a differential equation relating to average values in the 



PREFACE. 


vill 


ensemble which is identical in form with the fundamental differen¬ 
tial equation of thermodynamics, the average index of probability 
of phase, with change of sign, corresponding to entropy, and the 
modulus to temperature. 

For the average square of the anomalies of the energy, we find 
an expression which vanishes in comparison with the square of the 
average energy, when the number of degrees of freedom is indefi¬ 
nitely increased. An ensemble of systems in which the number of 
degrees of freedom is of the same order of magnitude as the number 
of molecules in the bodies with which we experiment, if distributed 
canonically, would therefore appear to human observation as an en¬ 
semble of systems in which all have the same energy. 

We meet with other quantities, in the development of the sub¬ 
ject, which, when the number of degrees of freedom is very great, 
coincide sensibly with the modulus, and with the average index of 
probability, taken negatively, in a canonical ensemble, and which, 
therefore, may also be regarded as corresponding to temperature 
and entropy. The correspondence is however imperfect, when the 
number of degrees of freedom is not very great, and there is nothing 
to recommend these quantities except that in definition they may be 
regarded as more simple than those which have been mentioned. In 
Chapter XIV, this subject of thermodynamic analogies is discussed 
somewhat at length. 

Finally, in Chapter XV, we consider the modification of the pre¬ 
ceding results which is necessary when we consider systems com¬ 
posed of a number of entirely similar particles, or, it may be, of a 
number of particles of several kinds, all of each kind being entirely 
similar to each other, and when one of the variations to be con¬ 
sidered is that of the numbers of the particles of the various kinds 
which are contained in a system. This supposition would naturally 
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have been introduced earlier, if our object had been simply the ex¬ 
pression of the laws of nature. It seemed desirable, however, to 
separate sharply the purely thermodynamic laws from those special 
modifications which belong rather to the theory of the properties of 
matter. 

J. W. G. 

New Haven, December, 1901. 



CONTENTS. 


CHAPTER I. 

GENERAL NOTIONS. THE PRINCIPLE OF CONSERVATION 
OF EXTENSION-IN-PHASE. 

Hamilton’s equations of motion. 1—3 

Ensemble of systems distributed in phase. 3 

Extension-in-phase, density-in-phase. 4 

Fundamental equation of statistical mechanics. 4—6 

Condition of statistical equilibrium. 6 

Principle of conservation of density-in-phase. 8 

Principle of conservation of extension-in-phase. 9 

Analogy in hydrodynamics. 10 

Extension-in-phase is an invariant. 10—13 

Dimensions of extension-in-phase. 13 

Various analytical expressions of the principle. 13—15 

Coefficient and index of probability of phase. 17 

Principle of conservation of probability of phase. 18, 19 

Dimensions of coefficient of probability of phase. 20 

CHAPTER II. 

APPLICATION OF THE PRINCIPLE OF CONSERVATION OF 
EXTENSION-IN-PHASE TO THE THEORY OF 
ERRORS. 


Approximate expression for the index of probability of phase.. .21, 22 
















CONTENTS. 


XI 


Application of the principle of conservation of probability of 

phase to the constants of this expression. 22—27 

CHAPTER III. 

APPLICATION OF THE PRINCIPLE OF CONSERVATION OF 
EXTENSION-IN-PHASE TO THE INTEGRATION OF 
THE DIFFERENTIAL EQUATIONS OF MOTION. 

Case in which the forces are functions of the coordinates alone . 28—32 
Case in which the forces are functions of the coordinates with 

the time. 33, 35 


CHAPTER IV. 

ON THE DISTRIBUTION-IN-PHASE CALLED CANONICAL, 
IN WHICH THE INDEX OF PROBABILITY IS A 
LINEAR FUNCTION OF THE ENERGY. 


Condition of statistical equilibrium. 35 

Other conditions which the coefficient of probability must satisfy.. .37 

Canonical distribution Modulus of distribution. 38 

"0 must be finite. 39 

The modulus of the canonical distribution has properties 

analogous to temperature. 39-41 

Other distributions have similar properties. 41 

Distribution in which the index of probability is a linear 
function of the energy and of the moments of momentum 
about three axes. 42, 43 










CONTENTS. Xll 

Case in which the forces are linear functions of the 

displacements, and the index is a linear function of the 
separate energies relating to the normal types of motion .... 43—46 
Differential equation relating to average values in a canonical 

ensemble. 47—49 

This is identical in form with the fundamental differential 

equation of thermodynamics. 49, 51 

CHAPTER V. 

AVERAGE VALUES IN A CANONICAL ENSEMBLE OF 
SYSTEMS. 

Case of v material points. Average value of kinetic energy of a 
single point for a given configuration or for the whole 

ensemble = |@. 51, 53 

Average value of total kinetic energy for any given configuration or 

for the whole ensemble = |z/@ . 53 

System of n degrees of freedom. Average value of kinetic energy, 

for any given configuration or for the whole ensemble = ^@.54—56 

Second proof of the same proposition. 56-58 

Distribution of canonical ensemble in configuration. 58—61 

Ensembles canonically distributed in configuration. 61 

Ensembles canonically distributed in velocity. 63 

CHAPTER VI. 

EXTENSION-IN-CONFIGURATION AND 
EXTENSION-IN-VELOCITY. 

Extension-in-configuration and extension-in-velocity are 

invariants. 64—67 











CONTENTS. 


Xlll 


Dimensions of these quantities. 68 

Index and coefficient of probability of configuration. 69 

Index and coefficient of probability of velocity. 71 

Dimensions of these coefficients. 72 

Relation between extension-in-configuration and 

extension-in-velocity. 73 

Definitions of extension-in-phase, extension-in-configuration, 
and extension-in-velocity, without explicit mention of 
coordinates. 74-77 

CHAPTER VII. 

FARTHER DISCUSSION OF AVERAGES IN A CANONICAL 
ENSEMBLE OF SYSTEMS. 

Second and third differential equations relating to average 

values in a canonical ensemble. 78, 80 

These are identical in form with thermodynamic equations 

enunciated by Clausius. 80 

Average square of the anomaly—of the energy of the kinetic 

energy—of the potential energy. 81—84 

These anomalies are insensible to human observation and 
experience when the number of degrees of freedom of the 
system is very great. 85, 86 

Average values of powers of the energies. 87—90 

Average values of powers of the anomalies of the energies. 90—93 

Average values relating to forces exerted on external bodies .... 93-97 
General formulae relating to averages in a canonical ensemble. 97—101 














CONTENTS. 


XIV 


CHAPTER VIII. 

ON CERTAIN IMPORTANT FUNCTIONS OF THE ENERGIES 
OF A SYSTEM. 

Definitions. V = extension-in-phase below a limiting 

energy (e). (j) = log dV/de . 101, 103 

V q = extension-in-configuration below a limiting value of the 

potential energy (e p ). cf) q = log dV q /de q . 104, 105 

V p = extension-in-velocity below a limiting value of the 

kinetic energy (e p ). cj)p = log dV p /de p . 105, 106 

Evaluation of V p and <f> p . 106-109 

Average values of functions of the kinetic energy. 110, 111 

Calculation of V from V q . Ill, 113 

Approximate formulae for large values of n. 114, 115 

Calculation of V or </> for whole system when given for parts. 115 

Geometrical illustration. 116 

CHAPTER IX. 

THE FUNCTION </> AND THE CANONICAL DISTRIBUTION. 

When n > 2, the most probable value of the energy in a 

canonical ensemble is determined by dcjj/de = 1/0. 117, 118 

When n > 2, the average value of dcfr/de in a canonical ensemble 

is 1/0. 118 

When n is large, the value of corresponding to d<f>/de = 10 
((j) o) is nearly equivalent (except for an additive constant) 
to the average index of probability taken negatively (—rj) . 118—122 













CONTENTS. 


XV 


Approximate formulae for (j>o + r] when n is large. 122—125 

When n is large, the distribution of a canonical ensemble in energy 
follows approximately the law of errors. 124 

This is not peculiar to the canonical distribution. 126, 128 

Averages in a canonical ensemble. 128—135 


CHAPTER X. 

ON A DISTRIBUTION IN PHASE CALLED 

MICROCANONICAL IN WHICH ALL THE SYSTEMS 
HAVE THE SAME ENERGY. 

The microcanonical distribution defined as the limiting 

distribution obtained by various processes. 135, 136 

Average values in the microcanonical ensemble of functions of 

the kinetic and potential energies. 138—141 

If two quantities have the same average values in every 

microcanonical ensemble, they have the same average value in 

every canonical ensemble. 141 

Average values in the microcanonical ensemble of functions of 

the energies of parts of the system. 142—145 

Average values of functions of the kinetic energy of a part of 

the system. 145, 146 

Average values of the external forces in a microcanonical 
ensemble. Differential equation relating to these averages, 
having the form of the fundamental differential equation 
of thermodynamics. 146—151 












CONTENTS. 


XVI 


CHAPTER XI. 

MAXIMUM AND MINIMUM PROPERTIES OF VARIOUS 
DISTRIBUTIONS IN PHASE. 

Theorems I-VI. Minimum properties of certain distributions. 152—157 
Theorem VII. The average index of the whole system 

compared with the sum of the average indices of the parts 157—159 
Theorem VIII. The average index of the whole ensemble 

compared with the average indices of parts of the ensemblel59— 162 
Theorem IX. Effect on the average index of making the 

distribution-in-phase uniform within any limits. 162—163 

CHAPTER XII. 

ON THE MOTION OF SYSTEMS AND ENSEMBLES OF 
SYSTEMS THROUGH LONG PERIODS OF TIME. 

Under what conditions, and with what limitations, may we 
assume that a system will return in the course of time to 
its original phase, at least to any required degree of 

approximation?. 163—167 

Tendency in an ensemble of isolated systems toward a state of 

statistical equilibrium. 168-177 


CHAPTER XIII. 

EFFECT OF VARIOUS PROCESSES ON AN ENSEMBLE OF 
SYSTEMS. 


Variation of the external coordinates can only cause a 
decrease in the average index of probability. 


177-180 






CONTENTS. 


XVII 


This decrease may in general be diminished by diminishing 


the rapidity of the change in the external coordinates. 180—183 

The mutual action of two ensembles can only diminish the 

sum of their average indices of probability. 184, 186 


In the mutual action of two ensembles which are canonically 

distributed, that which has the greater modulus will lose energyl87 
Repeated action between any ensemble and others which are 
canonically distributed with the same modulus will tend to 
distribute the first-mentioned ensemble canonically with the 


same modulus. 188 

Process analogous to a Carnot’s cycle. 189, 191 

Analogous processes in thermodynamics. 191, 192 


CHAPTER XIV. 

DISCUSSION OF THERMODYNAMIC ANALOGIES. 

The finding in rational mechanics an a priori foundation for 
thermodynamics requires mechanical definitions of 
temperature and entropy. Conditions which the quantities 
thus defined must satisfy. 193-196 

The modulus of a canonical ensemble (0), and the average 
index of probability taken negatively (r]), as analogues of 
temperature and entropy. 196-198 

The functions of the energy de/d log V and log V as analogues 

of temperature and entropy. 198—202 

The functions of the energy de/dcj) and 4> as analogues of 

temperature and entropy. 202—209 

Merits of the different systems. 209—215 












CONTENTS. 


XV111 


If a system of a great number of degrees of freedom is 

microcanonically distributed in phase, any very small part of it 


may be regarded as canonically distributed. 215 

Units of 0 and Tj compared with those of temperature and 

entropy. 215-219 


CHAPTER XV. 

SYSTEMS COMPOSED OF MOLECULES. 


Generic and specific definitions of a phase. 219—222 

Statistical equilibrium with respect to phases generically defined 

and with respect to phases specifically defined. 222 

Grand ensembles, petit ensembles. 222, 223 

Grand ensemble canonically distributed. 223—226 

H must be finite. 226 

Equilibrium with respect to gain or loss of molecules. 227—231 

Average value of any quantity in grand ensemble canonically 

distributed. 232 


Differential equation identical in form with fundamental 

differential equation in thermodynamics. 233, 235 

Average value of number of any kind of molecules {y) . 236 

Average value of {v — V ) 2 . 236, 237 

Comparison of indices. 238—242 

When the number of particles in a system is to be treated as 

variable, the average index of probability for phases generically 
defined corresponds to entropy. 242 
















ELEMENTARY PRINCIPLES IN 
STATISTICAL MECHANICS 


I. 

GENERAL NOTIONS. THE PRINCIPLE OF CONSERVATION 
OF EXTENSION-IN-PHASE. 

We shall use Hamilton’s form of the equations of motion for a sys¬ 
tem of n degrees of freedom, writing qi, ... q n for the (generalized) 
coordinates, q \, ... q n for the (generalized) velocities, and 


Fi dqi + F 2 dq 2 + • • • + F n dq n (1) 


for the moment of the forces. We shall call the quantities F 1 , ... F n 
the (generalized) forces, and the quantities pi, .. .p n , defined by the 
equations 


da^p ddp 

Pi = w, P 2 = tv, etc., 
dqi dq 2 


( 2 ) 


where e p denotes the kinetic energy of the system, the (generalized) 
momenta. The kinetic energy is here regarded as a function of the 
velocities and coordinates. We shall usually regard it as a function 
of the momenta and coordinates,* and on this account we denote 
it by e p . This will not prevent us from occasionally using formulae 
like (2), where it is sufficiently evident the kinetic energy is regarded 


*The use of the momenta instead of the velocities as independent variables 
is the characteristic of Hamilton’s method which gives his equations of motion 
their remarkable degree of simplicity. We shall find that the fundamental no¬ 
tions of statistical mechanics are most easily defined, and are expressed in the 
most simple form, when the momenta with the coordinates are used to describe 
the state of a system. 
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as function of the q 's and q's. But in expressions like de p /dqi, where 
the denominator does not determine the question, the kinetic energy 
is always to be treated in the differentiation as function of the p’s 
and q's. 

We have then 

Qi = Pi = ~t (p + Fi, etc. (3) 

dp i dq i 

These equations will hold for any forces whatever. If the forces 
are conservative, in other words, if the expression (1) is an exact 
differential, we may set 


rp _ de q 
1 dqi 


F 2 = 

dq 2 


etc., 


( 4 ) 


where e q is a function of the coordinates which we shall call the 
potential energy of the system. If we write e for the total energy, 
we shall have 

e = e p + e q , (5) 

and equations (3) may be written 




etc. 


( 6 ) 


The potential energy (e q ) may depend on other variables beside 
the coordinates q \, ... q n . We shall often suppose it to depend in 
part on coordinates of external bodies, which we shall denote by ai, 
02 , etc. We shall then have for the complete value of the differential 
of the potential energy* 


de q = —Fi dq\ — ■■■ — F n dq n — A 1 da x — A 2 da 2 — etc., (7) 


*It will be observed, that although we call e q the potential energy of the 
system which we are considering, it is really so defined as to include that energy 
which might be described as mutual to that system and external bodies. 
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where A 2 , etc., represent forces (in the generalized sense) ex¬ 
erted by the system on external bodies. For the total energy (e) we 
shall have 

de = qi dpi H-b q n dp n - pi d<?i- 

— p n dq n — A\ da i — A 2 da 2 — etc., (8) 

It will be observed that the kinetic energy ( e p ) in the most gen¬ 
eral case is a quadratic function of the p’s (or g’s) involving also 
the g’s but not the a’s; that the potential energy, when it exists, is 
function of the g’s and a’s; and that the total energy, when it exists, 
is function of the p’s (or <fs), the g’s, and the a’s. In expressions 
like de/dqi, the p’s, and not the g’s, are to be taken as independent 
variables, as has already been stated with respect to the kinetic 
energy. 

Let us imagine a great number of independent systems, identi¬ 
cal in nature, but differing in phase, that is, in their condition with 
respect to configuration and velocity. The forces are supposed to 
be determined for every system by the same law, being functions 
of the coordinates of the system qi, ... q n , either alone or with the 
coordinates ai, a 2 , etc. of certain external bodies. It is not nec¬ 
essary that they should be derivable from a force-function. The 
external coordinates a±, a 2 , etc. may vary with the time, but at any 
given time have fixed values. In this they differ from the internal 
coordinates <p, ... q n , which at the same time have different values 
in the different systems considered. 

Let us especially consider the number of systems which at a given 
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instant fall within specified limits of phase, viz., those for which 

Pi < Pi < Pi, 

Ih <P2 < P'l, 

Pn < Pn < Pn, 

the accented letters denoting constants. We shall suppose the differ¬ 
ences p'[ — p \, q'I — q[, etc. to be infinitesimal, and that the systems 
are distributed in phase in some continuous manner,* so that the 
number having phases within the limits specified may be represented 
by 

m - p[) • • ■ (Pn - p'Mi - q[) ■■■(€- q'n), ( 10 ) 

or more briefly by 

D dpi ■ ■ ■ dp n dqi ■ ■ ■ dq n , ( 11 ) 

where D is a function of the p’s and g’s and in general of t also, for 
as time goes on, and the individual systems change their phases, the 
distribution of the ensemble in phase will in general vary. In special 
cases, the distribution in phase will remain unchanged. These are 
cases of statistical equilibrium. 

If we regard all possible phases as forming a sort of extension of 
2 n dimensions, we may regard the product of differentials in (11) 

*In strictness, a finite number of systems cannot be distributed continu¬ 
ously in phase. But by increasing indefinitely the number of systems, we may 
approximate to a continuous law of distribution, such as is here described. To 
avoid tedious circumlocution, language like the above may be allowed, although 
wanting in precision of expression, when the sense in which it is to be taken 
appears sufficiently clear. 


q[ < qi < q'l, 
q 2 <q 2 < q'l, 


q n <q n < q'l,) 


(9) 
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as expressing an element of this extension, and D as expressing the 
density of the systems in that element. We shall call the product 

dpi - ■ ■ dp n dqi ■ ■ ■ dq n (12) 

an element of extension-in-phase, and D the density-in-phase of the 
systems. 

It is evident that the changes which take place in the density of 
the systems in any given element of extension-in-phase will depend 
on the dynamical nature of the systems and their distribution in 
phase at the time considered. 

In the case of conservative systems, with which we shall be 
principally concerned, their dynamical nature is completely deter¬ 
mined by the function which expresses the energy (e) in terms of the 
p’s, q’s, and a’s (a function supposed identical for all the systems); 
in the more general case which we are considering, the dynamical 
nature of the systems is determined by the functions which express 
the kinetic energy (e p ) in terms of the p’s and q’s, and the forces 
in terms of the q’s and a’s. The distribution in phase is expressed 
for the time considered by D as function of the p’s and q’s. To find 
the value of dD/dt for the specified element of extension-in-phase, 
we observe that the number of systems within the limits can only 
be varied by systems passing the limits, which may take place in 
4n different ways, viz., by the pi of a system passing the limit p \, 
or the limit p", or by the of a system passing the limit q[ or the 
limit q'l , etc. Let us consider these cases separately. 

In the first place, let us consider the number of systems which 
in the time dt pass into or out of the specified element by pi passing 
the limit p\. It will be convenient, and it is evidently allowable, to 
suppose dt so small that the quantities pi dt, cp dt, etc., which rep¬ 
resent the increments of p \, q\, etc., in the time dt shall be infinitely 
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small in comparison with the infinitesimal differences p'[ —p\ , q'(—q [, 
etc., which determine the magnitude of the element of extension-in- 
phase. The systems for which pi passes the limit p\ in the interval dt 
are those for which at the commencement of this interval the value 
of pi lies between p\ and p\ — pi dt, as is evident if we consider 
separately the cases in which p x is positive and negative. Those 
systems for which pi lies between these limits, and the other p’s 
and g’s between the limits specified in (9), will therefore pass into 
or out of the element considered according as p\ is positive or neg¬ 
ative, unless indeed they also pass some other limit specified in (9) 
during the same interval of time. But the number which pass any 
two of these limits will be represented by an expression containing 
the square of dt as a factor, and is evidently negligible, when dt is 
sufficiently small, compared with the number which we are seeking 
to evaluate, and which (with neglect of terms containing d,t' 2 ) may 
be found by substituting pi dt for p'[ — p\ in (10) or for dpi in (11). 

The expression 

D pi dt dp 2 ■ ■ ■ dp n dqi ■ ■ ■ dq n (13) 


will therefore represent, according as it is positive or negative, the 
increase or decrease of the number of systems within the given limits 
which is due to systems passing the limit p\. A similar expression, 
in which however D and p will have slightly different values (being 
determined for p'[ instead of p \), will represent the decrease or in¬ 
crease of the number of systems due to the passing of the limit p". 
The difference of the two expressions, or 


d(Dpi) 

dpi 


dpi ■ ■ ■ dp n dqi ■ ■ ■ dq n dt 


(14) 


will represent algebraically the decrease of the number of systems 
within the limits due to systems passing the limits p[ and p". 
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The decrease in the number of systems within the limits due to 
systems passing the limits q\ and q'{ may be found in the same way. 
This will give 


( ^dp ^ + ^dq dPl ' " dpn dqi "' dqn dt ( 15 ) 

for the decrease due to passing the four limits p[, p", q[, q”. But 
since the equations of motion (3) give 


dpi dqi _ Q 

dpi dqi 


the expression reduces to 
fdD . dl 

Pl dqi ^J 


fdD . dD . \ , 

— Pi H—;— Qi ) dpi - ■ ■ dp n dqi - ■ ■ dq n dt. 


(16) 


(17) 


If we prefix to denote summation relative to the suffixes 
1, ... n, we get the total decrease in the number of systems within 
the limits in the time dt. That is, 



dpi ■ ■ ■ dp n dqi ■ ■ ■ dq n dt 

= —dD dpi ■ ■ ■ dp n dqi ■ ■ ■ dq n: 


(18) 


or 




dD dD 

~T~ Pi + x— q i 

dpi dqi 



(19) 


where the suffix applied to the differential coefficient indicates that 
the p’s and q's are to be regarded as constant in the differentiation. 
The condition of statistical equilibrium is therefore 



0 . 


( 20 ) 
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If at any instant this condition is fulfilled for all values of the p’s 
and g’s, ( dD/dt) p ^ q vanishes, and therefore the condition will con¬ 
tinue to hold, and the distribution in phase will be permanent, so 
long as the external coordinates remain constant. But the statis¬ 
tical equilibrium would in general be disturbed by a change in the 
values of the external coordinates, which would alter the values of 
the p’s as determined by equations (3), and thus disturb the relation 
expressed in the last equation. 

If we write equation (19) in the form 


fdD\ , 

( “) dt 
V dt ) p q 


E 


dD dD > 

— pidt + —qidt) =0, (21) 


it will be seen to express a theorem of remarkable simplicity. Since 
D is a function of t, pi, .. .p n , qi, ... q n , its complete differential 
will consist of parts due to the variations of all these quantities. 
Now the first term of the equation represents the increment of D 
due to an increment of t (with constant values of the p’s and q' s), 
and the rest of the first member represents the increments of D 
due to increments of the p’s and g’s, expressed by pi dt, q\ dt, etc. 
But these are precisely the increments which the p’s and q 's receive 
in the movement of a system in the time dt. The whole expression 
represents the total increment of D for the varying phase of a moving 
system. We have therefore the theorem:— 

In an ensemble of mechanical systems identical in nature and 
subject to forces determined by identical laws, but distributed in 
phase in any continuous manner, the density-in-phase is constant 
in time for the varying phases of a moving system; provided, that 
the forces of a system are functions of its coordinates, either alone 
or with the time.* 


The condition that the forces F\, ... F n are functions of qi , ... q n and 
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This may be called the principle of conservation of density-in- 
phase. It may also be written 



( 22 ) 


where a, .. .h represent the arbitrary constants of the integral equa¬ 
tions of motion, and are suffixed to the differential coefficient to 
indicate that they are to be regarded as constant in the differenti¬ 
ation. 

We may give to this principle a slightly different expression. Let 
us call the value of the integral 


dpi ■ ■ ■ dp n dqi ■ ■ ■ dq. n 


taken within any limits the extension-in-phase within those limits. 

When the phases bounding an extension-in-phase vary in the 
course of time according to the dynamical laws of a system subject 
to forces which are functions of the coordinates either alone or with 
the time, the value of the extension-in-phase thus bounded remains 
constant. In this form the principle may be called the principle 
of conservation of extension-in-phase. In some respects this may 
be regarded as the most simple statement of the principle, since it 
contains no explicit reference to an ensemble of systems. 

Since any extension-in-phase may be divided into infinitesimal 
portions, it is only necessary to prove the principle for an infinitely 

ai, 02, etc., which last are functions of the time, is analytically equivalent to 
the condition that F\, ... F n are functions of qi, ... q„ and the time. Explicit 
mention of the external coordinates, ai, 02, etc., has been made in the pre¬ 
ceding pages, because our purpose will require us hereafter to consider these 
coordinates and the connected forces, A\, A2, etc., which represent the action 
of the systems on external bodies. 
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small extension. The number of systems of an ensemble which fall 
within the extension will be represented by the integral 


D dpi ■ ■ ■ dp n dqi ■ ■ ■ dq n 


If the extension is infinitely small, we may regard D as constant in 
the extension and write 


D 


■ ■ ■ dp n dqi ■ ■ ■ dq n 


for the number of systems. The value of this expression must be 
constant in time, since no systems are supposed to be created or 
destroyed, and none can pass the limits, because the motion of the 
limits is identical with that of the systems. But we have seen that 
D is constant in time, and therefore the integral 


dpi - ■■ dp n dqi ■ ■ ■ dq n , 


which we have called the extension-in-phase, is also constant in 
time.* 

Since the system of coordinates employed in the foregoing dis¬ 
cussion is entirely arbitrary, the values of the coordinates relating 

*If we regard a phase as represented by a point in space of 2 n dimensions, the 
changes which take place in the course of time in our ensemble of systems will 
be represented by a current in such space. This current will be steady so long as 
the external coordinates are not varied. In any case the current will satisfy a law 
which in its various expressions is analogous to the hydrodynamic law which 
may be expressed by the phrases conservation of volumes or conservation of 
density about a moving point, or by the equation 
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to any configuration and its immediate vicinity do not impose any 
restriction upon the values relating to other configurations. The 
fact that the quantity which we have called density-in-phase is con¬ 
stant in time for any given system, implies therefore that its value is 
independent of the coordinates which are used in its evaluation. For 
let the density-in-phase as evaluated for the same time and phase 
by one system of coordinates be D \, and by another system D 2 . 
A system which at that time has that phase will at another time 
have another phase. Let the density as calculated for this second 
time and phase by a third system of coordinates be D 3 . Now we 
may imagine a system of coordinates which at and near the first 
configuration will coincide with the first system of coordinates, and 
at and near the second configuration will coincide with the third 
system of coordinates. This will give D\ = D'L Again we may 
imagine a system of coordinates which at and near the first con¬ 
figuration will coincide with the second system of coordinates, and 
at and near the second configuration will coincide with the third 
system of coordinates. This will give D' 2 = D". We have therefore 
D[ = D f 2 . 

It follows, or it may be proved in the same way, that the value of 
an extension-in-phase is independent of the system of coordinates 
which is used in its evaluation. This may easily be verified directly. 

The analogue in statistical mechanics of this equation, viz., 

dp 1 , dq ± dp 2 dq 2 

h -^ h -^ h-^ j -1 ete - — 0, 

dpi dqi dp 2 dq 2 

may be derived directly from equations ( 3 ) or ( 6 ), and may suggest such the¬ 
orems as have been enunciated, if indeed it is not regarded as making them 
intuitively evident. The somewhat lengthy demonstrations given above will at 
least serve to give precision to the notions involved, and familiarity with their 


use. 
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If qi, .. . q n , Qi, ... Q n are two systems of coordinates, and Pi, ■ ■ ■ p n , 
Pi, ... P n the corresponding momenta, we have to prove that 


dpi ■ ■ ■ dp n clqi ■ ■ ■ dq n 


dP\ ■ ■ ■ dP n d() • • • d() ,i. 


(24) 

when the multiple integrals are taken within limits consisting of the 
same phases. And this will be evident from the principle on which 
we change the variables in a multiple integral, if we prove that 


d(P\, . . . P n , Qh ■ ■ ■ Qn) 
d(pi,...p n ,qi,...q n ) 


where the first member of the equation represents a Jacobian or 
functional determinant. Since all its elements of the form dQ/dp 
are equal to zero, the determinant reduces to a product of two, and 
we have to prove that 


d(Pi, ■ ■ ■ P n ) d(Q i,... Q n ) 

d(jP\, ■ ■ • Pn) d(q 1 ,...q n ) 


We may transform any element of the first of these determinants 
as follows. By equations (2) and (3), and in view of the fact that 
the Q' s are linear functions of the q’ s and therefore of the p’s, with 
coefficients involving the q' s, so that a differential coefficient of the 
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form dQ r /dpy is function of the q' s alone, we get* 


dP x d de p 


r=n 

E 


d 2 e p dQ r 


dp y dp y dQ x f^\dQ r dQ x dp y 


d 


r=n 

E 


de p dQ r 


d de r . 


dq y 


dQx fo \ dQr dpy I dQx dpy dQx 


(27) 


But since 


% 


! E 

r =1 

dq y 


dq y 
dQ r 
_ dqy 

dQ x dQi 


Q, 


(28) 


Therefore, 


d(Pi,...P n ) d(qi,...q n ) d(qi,...q n ) 


d(pi, ■ • ■ Pn) d(Q 1 , . . . Qn) d(Q 1, . . . Qn) 

The equation to be proved is thus reduced to 
d(Pi, ... P n ) d(Qi,... Q n ) 


(29) 


d(pi , • • • Pn) d(q 1 ,...q n ) 


= 1 , 


(30) 


"The form of the equation 


d der 


d de n 


dp y dQ x dQ x dp y 

in ( 27 ) reminds us of the fundamental identity in the differential calculus relat¬ 
ing to the order of differentiation with respect to independent variables. But 
it will be observed that here the variables Q x and p y are not independent and 
that the proof depends on the linear relation between the Q’s and the p’s. 
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which is easily proved by the ordinary rule for the multiplication of 
determinants. 

The numerical value of an extension-in-phase will however de¬ 
pend on the units in which we measure energy and time. For a 
product of the form dpdq has the dimensions of energy multiplied 
by time, as appears from equation (2), by which the momenta are 
defined. Hence an extension-in-phase has the dimensions of the 
nth power of the product of energy and time. In other words, it has 
the dimensions of the nth power of action , as the term is used in 
the ‘principle of Least Action .’ 

If we distinguish by accents the values of the momenta and 
coordinates which belong to a time t', the unaccented letters relating 
to the time t, the principle of the conservation of extension-in-phase 
may be written 


dpi ■ ■ ■ dp n dq\ ■ ■ ■ dq n 


■ ■ ■ dp n dq\ • • • dq n , (31) 


or more briefly 


dp 1 • • • dg„ 


■■■dC (32) 


the limiting phases being those which belong to the same systems 
at the times t and t' respectively. But we have identically 


dp 1 ■ ■ ■ dq n 


d(pi,...q n ) 

d(p\ , • • • q’n) 


dp'i • • • dq' n 


for such limits. The principle of conservation of extension-in-phase 
may therefore be expressed in the form 


d{pi,...q n ) 
d(p[, ■■■q ' n ) 


1 . 


(33) 
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This equation is easily proved directly. For we have identically 

d(pi,...q n ) = rf(pi,---gn) ... q") 

d(p'i, ■ ■ ■ q' n ) d{p'{, ...q") d(p[,... q' n ) ’ 

where the double accents distinguish the values of the momenta 
and coordinates for a time t". If we vary t, while t' and t" remain 
constant, we have 

d_ d(p l ,...q n ) _ d(j/j ,... g") d_ d(pi,...q n ) 
dt d (p \, ...q' n ) d(p \, ...q' n ) dt d{jpf{,... q ”)' 

Now since the time t" is entirely arbitrary, nothing prevents us 
from making t" identical with t at the moment considered. Then 
the determinant 

d(pi,...q n ) 
dip ", • • • q'n) 

will have unity for each of the elements on the principal diagonal, 
and zero for all the other elements. Since every term of the determi¬ 
nant except the product of the elements on the principal diagonal 
will have two zero factors, the differential of the determinant will 
reduce to that of the product of these elements, i.e., to the sum of 
the differentials of these elements. This gives the equation 

d djpi, ...q n ) _ dpi dpn dqi dq n 

dt d(p", ■■ .q'n) dp" dpl dq'{ dq 

Now since t = t" , the double accents in the second member of this 
equation may evidently be neglected. This will give, in virtue of 
such relations as (16), 

d_ d(pi,...g n ) = () 
dt d(p ",... g") 
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which substituted in (34) will give 


d_ d(pi,...q n ) = 
dt d(p \, ...q' n ) 


The determinant in this equation is therefore a constant, the value 
of which may be determined at the instant when t = t !, when it is 
evidently unity. Equation (33) is therefore demonstrated. 

Again, if we write a, ... h for a system of 2n arbitrary constants 
of the integral equations of motion, p \, q 1 , etc. will be functions of 
a, ... h, and t, and we may express an extension-in-phase in the 
form 


d(pi, ...q n ) 
d(a,. . . , h) 


da - ■ ■ dh. 


(35) 


If we suppose the limits specified by values of a, ... h, a system 
initially at the limits will remain at the limits. The principle of 
conservation of extension-in-phase requires that an extension thus 
bounded shall have a constant value. This requires that the de¬ 
terminant under the integral sign shall be constant, which may be 
written 


d_ d(pi,... q n ) 
dt d(a ,..., h) 


(36) 


This equation, which may be regarded as expressing the principle 
of conservation of extension-in-phase, may be derived directly from 
the identity 


d(pi,...q n ) = d{p l ,...q n ) d(p[,. .. g' n ) 
d(a,...,h) d(p[,... q’ n ) d(a,...,h) 


in connection with equation (33). 

Since the coordinates and momenta are functions of a, ... h, 
and t, the determinant in (36) must be a function of the same 
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variables, and since it does not vary with the time, it must be a 
function of a, ... h alone. We have therefore 


d(pi,... q n ) 
d(a ,..., h) 


func.(a,... h). 


(37) 


It is the relative numbers of systems which fall within different 
limits, rather than the absolute numbers, with which we are most 
concerned. It is indeed only with regard to relative numbers that 
such discussions as the preceding will apply with literal precision, 
since the nature of our reasoning implies that the number of systems 
in the smallest element of space which we consider is very great. 
This is evidently inconsistent with a finite value of the total number 
of systems, or of the density-in-phase. Now if the value of D is 
infinite, we cannot speak of any definite number of systems within 
any finite limits, since all such numbers are infinite. But the ratios 
of these infinite numbers may be perfectly definite. If we write N 
for the total number of systems, and set 


P = 


D 

N f 


(38) 


P may remain finite, when N and D become infinite. The integral 


P dpi... dq n 


(39) 


taken within any given limits, will evidently express the ratio of the 
number of systems falling within those limits to the whole number 
of systems. This is the same thing as the probability that an un¬ 
specified system of the ensemble (he. one of which we only know 
that it belongs to the ensemble) will lie within the given limits. The 
product 


P dpi... dq n 


( 40 ) 
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expresses the probability that an unspecihed system of the ensemble 
will be found in the element of extension-in-phase dpi... dq n . We 
shall call P the coefficient of probability of the phase considered. Its 
natural logarithm we shall call the index of probability of the phase, 
and denote it by the letter 77 . 

If we substitute NP and Ne ri for D in equation (19), we get 



(41) 


and 



(42) 


The condition of statistical equilibrium may be expressed by equat¬ 
ing to zero the second member of either of these equations. 

The same substitutions in (22) give 



(43) 


and 



(44) 


That is, the values of P and 77 , like those of D, are constant in 
time for moving systems of the ensemble. From this point of view, 


the principle which otherwise regarded has been called the principle 


of conservation of density-in-phase or conservation of extension-in- 
phase, may be called the principle of conservation of the coefficient 
(or index) of probability of a phase varying according to dynamical 
laws, or more briefly, the principle of conservation of probability 


of phase. It is subject to the limitation that the forces must be 
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functions of the coordinates of the system either alone or with the 
time. 

The application of this principle is not limited to cases in which 
there is a formal and explicit reference to an ensemble of systems. 
Yet the conception of such an ensemble may serve to give precision 
to notions of probability. It is in fact customary in the discussion 
of probabilities to describe anything which is imperfectly known as 
something taken at random from a great number of things which 
are completely described. But if we prefer to avoid any reference to 
an ensemble of systems, we may observe that the probability that 
the phase of a system falls within certain limits at a certain time, is 
equal to the probability that at some other time the phase will fall 
within the limits formed by phases corresponding to the first. For 
either occurrence necessitates the other. That is, if we write P' for 
the coefficient of probability of the phase p \, ... q' n at the time t', 
and P" for that of the phase p", ... q” at the time t", 

[■■■[ P' dp'i ■ ■ ■ dq' n — [■■■[ P"dp';--dql (45) 

where the limits in the two cases are formed by corresponding 
phases. When the integrations cover infinitely small variations of 
the momenta and coordinates, we may regard P' and P" as constant 
in the integrations and write 

P j l dp[---dP = P" j--jAfi — K- 

Now the principle of the conservation of extension-in-phase, which 
has been proved (viz., in the second demonstration given above) 
independently of any reference to an ensemble of systems, requires 
that the values of the multiple integrals in this equation shall be 
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equal. This gives 

P" = P'. 

With reference to an important class of cases this principle may 
be enunciated as follows. 

When the differential equations of motion are exactly known, 
but the constants of the integral equations imperfectly determined, 
the coefficient of probability of any phase at any time is equal to 
the coefficient of probability of the corresponding phase at any other 
time. By corresponding phases are meant those which are calculated 
for different times from the same values of the arbitrary constants 
of the integral equations. 

Since the sum of the probabilities of all possible cases is neces¬ 
sarily unity, it is evident that we must have 

[ - a - U [ P dpi ■ ■ ■ dq n = 1, (46) 

J phases J 

where the integration extends over all phases. This is indeed only 
a different form of the equation 


N 


j all 

phases 


D dpi ■ ■ ■ dq n , 


which we may regard as defining N. 

The values of the coefficient and index of probability of phase, 
like that of the density-in-phase, are independent of the system of 
coordinates which is employed to express the distribution in phase 
of a given ensemble. 

In dimensions, the coefficient of probability is the reciprocal of 
an extension-in-phase, that is, the reciprocal of the nth power of the 
product of time and energy. The index of probability is therefore 
affected by an additive constant when we change our units of time 
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and energy. If the unit of time is multiplied by c t and the unit 
of energy is multiplied by c e , all indices of probability relating to 
systems of n degrees of freedom will be increased by the addition of 

n log c t + n log c e . (47) 



II. 


APPLICATION OF THE PRINCIPLE OF CONSERVATION OF 
EXTENSION-IN-PHASE TO THE THEORY OF ERRORS. 

Let us now proceed to combine the principle which has been 
demonstrated in the preceding chapter and which in its different 
applications and regarded from different points of view has been 
variously designated as the conservation of density-in-phase, or of 
extension-in-phase, or of probability of phase, with those approxi¬ 
mate relations which are generally used in the ‘theory of errors.’ 

We suppose that the differential equations of the motion of a 
system are exactly known, but that the constants of the integral 
equations are only approximately determined. It is evident that the 
probability that the momenta and coordinates at the time t' fall 
between the limits p\ and p\ + dp[, q[ and q[ + dq [, etc., may be 
expressed by the formula 

e 11 ' dp^ ... dq' n , (48) 

where rf (the index of probability for the phase in question) is a 
function of the coordinates and momenta and of the time. 

Let Q \, P[, etc. be the values of the coordinates and momenta 
which give the maximum value to rf, and let the general value of rf 
be developed by Taylor’s theorem according to ascending powers 
and products of the differences p\ — P[, q[ — Q[, etc., and let us sup¬ 
pose that we have a sufficient approximation without going beyond 
terms of the second degree in these differences. We may therefore 
set 

rf — c — F', (49) 

where c is independent of the differences p\ — P[, q[ — Q \, etc., and 
F’ is a homogeneous quadratic function of these differences. The 
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terms of the first degree vanish in virtue of the maximum condition, 
which also requires that F' must have a positive value except when 
all the differences mentioned vanish. If we set 

C = e c , (50) 

we may write for the probability that the phase lies within the limits 
considered 

Ce~ F dp\ ... dq' n . (51) 

C is evidently the maximum value of the coefficient of probability 
at the time considered. 

In regard to the degree of approximation represented by these 
formulae, it is to be observed that we suppose, as is usual in the 
‘theory of errors,’ that the determination (explicit or implicit) of the 
constants of motion is of such precision that the coefficient of prob¬ 
ability e 71 or Ce~ h is practically zero except for very small values 
of the differences p\ — P [, q\ — Q[, etc. For very small values of these 
differences the approximation is evidently in general sufficient, for 
larger values of these differences the value of Ce~ F will be sensibly 
zero, as it should be, and in this sense the formula will represent 
the facts. 

We shall suppose that the forces to which the system is subject 
are functions of the coordinates either alone or with the time. The 
principle of conservation of probability of phase will therefore apply, 
which requires that at any other time ( t") the maximum value of 
the coefficient of probability shall be the same as at the time f', and 
that the phase (P", Q”, etc.) which has this greatest probability- 
coefficient, shall be that which corresponds to the phase (P(, Q[, 
etc.), i.e., which is calculated from the same values of the constants 
of the integral equations of motion. 
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We may therefore write for the probability that the phase at the 
time t" falls within the limits p" and p” + dp”, q” and q” + dq", etc., 

Ce~ F dp” ... dq”, (52) 

where C represents the same value as in the preceding formula, viz., 
the constant value of the maximum coefficient of probability, and 
F" is a quadratic function of the differences p” — P", q\ — Q", etc., 
the phase (P", Q", etc.) being that which at the time t" corresponds 
to the phase {P[, Q \, etc.) at the time t'. 

Now we have necessarily 

J- ■ j Ce~ F dp\ ... dq n = J ■ ■ ■ J Ce~ b dp” ... dq” = 1, (53) 


when the integration is extended over all possible phases. It will 
be allowable to set ±oo for the limits of all the coordinates and 
momenta, not because these values represent the actual limits of 
possible phases, but because the portions of the integrals lying out¬ 
side of the limits of all possible phases will have sensibly the value 
zero. With ±oo for limits, the equation gives 


Cn n _ Cn n 

7T~7r 


(54) 


where f is the discriminant* of F', and f” that of F". This discrim¬ 
inant is therefore constant in time, and like C an absolute invariant 
in respect to the system of coordinates which may be employed. In 
dimensions, like C 2 , it is the reciprocal of the 2 nth power of the 
product of energy and time. 


*This term is used to denote the determinant having for elements on the 
principal diagonal the coefficients of the squares in the quadratic function F', 
and for its other elements the halves of the coefficients of the products in F'. 
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Let us see precisely how the functions F" and F' are related. The 
principle of the conservation of the probability-coefficient requires 
that any values of the coordinates and momenta at the time t' shall 
give the function F' the same value as the corresponding coordinates 
and momenta at the time t" give to F". Therefore F" may be 
derived from F' by substituting for p \, ... q' n their values in terms 
of p", ... g". Now we have approximately 


?! - U = 4§(pf - p") + • • • + Sm - «). 


dP” 


dQ 


~ Q ” = ||w - ^ + 


dQ_ 
dQ'y Qn ^ n)l 


(55) 


and as in F" terms of higher degree than the second are to be ne¬ 
glected, these equations may be considered accurate for the purpose 
of the transformation required. Since by equation (33) the eliminant 
of these equations has the value unity, the discriminant of F" will be 
equal to that of F', as has already appeared from the consideration 
of the principle of conservation of probability of phase, which is, in 
fact, essentially the same as that expressed by equation (33). 

At the time t', the phases satisfying the equation 


F' 


k, 


(56) 


where k is any positive constant, have the probability-coefficient 
Ce~ k . At the time t", the corresponding phases satisfy the equation 

F" = k, (57) 


and have the same probability-coefficient. So also the phases 
within the limits given by one or the other of these equations 
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are corresponding phases, and have probability-coefficients greater 
than Ce~ k , while phases without these limits have less probability- 
coefficients. The probability that the phase at the time t' falls 
within the limits F" — k is the same as the probability that it falls 
within the limits F" — k at the time t", since either event necessi¬ 
tates the other. This probability may be evaluated as follows. We 
may omit the accents, as we need only consider a single time. Let 
us denote the extension-in-phase within the limits F = k by U, 
and the probability that the phase falls within these limits by R , 
also the extension-in-phase within the limits F — 1 by U\. We have 
then by definition 


U = 



■ ■ ■ dq n , 


(58) 


cF=k 


R = 


Ce k dp\ ... dq n , 


f*F=l 


Ui = 


dpi . . . dq n . 


(59) 


(60) 


But since F is a homogeneous quadratic function of the differences 
pi-Pi, p 2 - P 2 , ••., q n ~Qn , 

we have identically 

rF=k r 


d{jp i Pi) • • • d{q n Qn ) 

rkF=k 


k n d(pi — Pi)... d(q n — Q n ) 


(*F= 1 


= k r ‘ 


d(pi - Pi) ■ ■ ■ d(q n - Qn). 
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That is 

whence 


U = k n U x , 


dU = Umk ri “ 1 dk. 

But if k varies, equations (58) and (59) give 


dU = 


dR = 


nF=k-\-dk 

’ F=k 
pF=k-\-dk 

IF=k 


dpi ■ . ■ dq n , 


Ce F dpi... dq n . 


(61) 

(62) 


(63) 

(64) 


Since the factor Ce f has the constant value Ce k in the last 
multiple integral, we have 


dR = Ce~ k dU = CUme^k 11 - 1 dk, 

( k 2 fc n_1 \ 

R = -CUin\e~ k ( 1 + k + — H-b 7-—7 ) + const. 

V 2 


(65) 

( 66 ) 


We may determine the constant of integration by the condition that 
R vanishes with k. This gives 

/ r.2 un-l \ 

R = CUin\ - CUin\e~ k 1 + k + — + ••• + 7 -— . (67) 

V 2 \n-l)\J 


We may determine the value of the constant U\ by the condition 
that R — 1 for k = 00 . This gives CUpnl = 1, and 


, ;„2 


u = 


k n 

CC.' 


( 68 ) 

(69) 
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It is worthy of notice that the form of these equations depends 
only on the number of degrees of freedom of the system, being in 
other respects independent of its dynamical nature, except that the 
forces must be functions of the coordinates either alone or with the 
time. 

If we write 



for the value of k which substituted in equation (68) will give R= 
the phases determined by the equation 

F = k E .,_ (70) 

will have the following properties. 

The probability that the phase falls within the limits formed by 
these phases is greater than the probability that it falls within any 
other limits enclosing an equal extension-in-phase. It is equal to the 
probability that the phase falls without the same limits. 

These properties are analogous to those which in the theory of 
errors in the determination of a single quantity belong to values 
expressed by A ± a, when A is the most probable value, and a the 
‘probable error.’ 


III. 


APPLICATION OF THE PRINCIPLE OF CONSERVATION OF 
EXTENSION-IN-PHASE TO THE INTEGRATION OF THE 
DIFFERENTIAL EQUATIONS OF MOTION.* 

We have seen that the principle of conservation of extension- 
in-phase may be expressed as a differential relation between the 
coordinates and momenta and the arbitrary constants of the inte¬ 
gral equations of motion. Now the integration of the differential 
equations of motion consists in the determination of these constants 
as functions of the coordinates and momenta with the time, and 
the relation afforded by the principle of conservation of extension- 
in-phase may assist us in this determination. 

It will be convenient to have a notation which shall not distin¬ 
guish between the coordinates and momenta. If we write rq, ... r 2n 
for the coordinates and momenta, and a, ... h as before for the ar¬ 
bitrary constants, the principle of which we wish to avail ourselves, 
and which is expressed by equation (37), may be written 

---^ = func ,(a,...h). (71) 

Let us first consider the case in which the forces are determined 
by the coordinates alone. Whether the forces are ‘conservative’ 
or not is immaterial. Since the differential equations of motion 
do not contain the time (t) in the finite form, if we eliminate dt 
from these equations, we obtain 2n — 1 equations in rq, ... r 2n and 
their differentials, the integration of which will introduce 2 n — 1 

*See Boltzmann: “Zusammenhang zwischen den Satzen liber das Verhalten 
mehratomiger Gasmoleklile mit Jacobi’s Princip des letzten Multiplicators.” 
Sitzb. der Wiener Akad., Bd. LXIII, Abth. II., S. 679, (1871). 
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arbitrary constants which we shall call b, ... h. If we can effect these 
integrations, the remaining constant (a) will then be introduced in 
the final integration, (viz., that of an equation containing dt,) and 
will be added to or subtracted from t in the integral equation. Let 
us have it subtracted from t. It is evident then that 


dri . dr 2 

da ’ da 


-r 2 , etc. 


(72) 


Moreover, since b, ... h and t — a are independent functions of 
ri, ...r 2n , the latter variables are functions of the former. The 
Jacobian in (71) is therefore function of b, ... h, and t — a, and since 
it does not vary with t it cannot vary with a. We have therefore 
in the case considered, viz., where the forces are functions of the 
coordinates alone, 


d(n ,.. .r 2n ) 
d(a,. . . h) 


func.(6 ,... h). 


(73) 


Now let us suppose that of the first 2n — 1 integrations we have 
accomplished all but one, determining 2n — 2 arbitrary constants 
(say c, ... h) as functions of rq, ... r 2n , leaving b as well as a to be 
determined. Our 2 n — 2 finite equations enable us to regard all the 
variables ri, ... r 2n , and all functions of these variables as functions 
of two of them, (say r\ and r 2 ,) with the arbitrary constants c, ... h. 
To determine b, we have the following equations for constant values 
of c, ... h. 


dri 

dr 2 


dr i 
da 
dr 2 

da 


da T 
da T 


dr i 
~db 
dr 2 

~db 


db, 
db , 
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whence 


d(r 1 ,r 2 ) 
d(a , b ) 


dr 2 

da 


dr i + 


dri 

da 


dr 2 . 


Now, by the ordinary formula for the change of variables, 


(74) 


d{r 1 ,r 2 ) 
d(a } b) 


da db dr 3 ... dr 2n = 


dri... dr 2n 


da... dh 


d(n,. ..r 2n ) 
d(a,. .. h) 
d(r 1 ,...r 2n ) d{c,...h) 


d(a,...h ) d(r 3 ,...r 2n ) 


da db dr 3 


■ ■ dr 2n, 


where the limits of the multiple integrals are formed by the same 
phases. Hence 


d{r i, r 2 ) _ d(r 1 ,...r 2n ) d(c, ...h) 
d(a, b) d(a, ...h) d(r 3 ,... r 2n ) ’ 

With the aid of this equation, which is an identity, and (72), we 
may write equation (74) in the form 


,...r 2n ) d(c,...h ) ( > 

-tt—-- db — r 2 dr x - r 3 dr 2 . (76) 

d(r 3 ,... r 2n ) 

The separation of the variables is now easy. The differential 
equations of motion give f\ and r 2 in terms of r 3 , ... r 2n . The in¬ 
tegral equations already obtained give c, ... h and therefore the Ja¬ 
cobian d(c,... h)/d(r 3 ,... r 2n ), in terms of the same variables. But 
in virtue of these same integral equations, we may regard functions 
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of rr, ... r 2n as functions of rq and r 2 with the constants c, ... h. If 
therefore we write the equation in the form 


d(r i,.. .r 2n ) 

d(a,... h) 


A 

d(c,...h ) ?1 

d(r 3 ,.. .r 2 „) 


d(c,...h) dr21 
d(r 3 ,... r 2n ) 


(77) 


the coefficients of drq and dr 2 may be regarded as known functions 
of rq and r 2 with the constants c, ... /r. The coefficient of db is 
by (73) a function of b, ... h. It is not indeed a known function 
of these quantities, but since c, ... h are regarded as constant in 
the equation, we know that the first member must represent the 
differential of some function of b, ... /r, for which we may write b'. 
We have thus 


db' 


d(c,...h ) dVl d(c, ...h) dr2 ’ 
d(r 3 ,...r 2n ) d(r 3 ,... r 2n ) 


(78) 


which may be integrated by quadratures and gives b" as functions 
of r 3 , r 2 , ... c, ... h, and thus as function of r 3 , ... r 2n . 

This integration gives us the last of the arbitrary constants which 
are functions of the cobrdinates and momenta without the time. 
The final integration, which introduces the remaining constant (a), 
is also a quadrature, since the equation to be integrated may be 
expressed in the form 


dt = F(r \) dr x . 

Now, apart from any such considerations as have been adduced, 
if we limit ourselves to the changes which take place in time, we 
have identically 


r 2 dr 3 — fq dr 2 = 0, 
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and fq and r 2 are given in terms of 7q, ... r 2n by the differential 
equations of motion. When we have obtained 2n — 2 integral equa¬ 
tions, we may regard r 2 and fq as known functions of rq and r 2 . The 
only remaining difficulty is in integrating this equation. If the case 
is so simple as to present no difficulty, or if we have the skill or the 
good fortune to perceive that the multiplier 


1 

d(c,... h) ’ 
d(r 3 ,... r 2 „) 


(79) 


or any other, will make the first member of the equation an exact 
differential, we have no need of the rather lengthy considerations 
which have been adduced. The utility of the principle of conser¬ 
vation of extension-in-phase is that it supplies a ‘multiplier’ which 
renders the equation integrable, and which it might be difficult or 
impossible to find otherwise. 

It will be observed that the function represented by b' is a partic¬ 
ular case of that represented by b. The system of arbitrary constants 
a, b', c, ... h has certain properties notable for simplicity. If we write 
b' for b in (77), and compare the result with (78), we get 


d(r i,...r 2n ) 
d(a, b\ c,... h ) 

Therefore the multiple integral 


(80) 


/ 


■ dadb' dc... dh 


(81) 


taken within limits formed by phases regarded as contemporaneous 
represents the extension-in-phase within those limits. 
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The case is somewhat different when the forces are not deter¬ 
mined by the coordinates alone, but are functions of the coordinates 
with the time. All the arbitrary constants of the integral equations 
must then be regarded in the general case as functions of rr, ... r 2 n , 
and t. We cannot use the principle of conservation of extension- 
in-phase until we have made 2 n — 1 integrations. Let us suppose 
that the constants b, ... h have been determined by integration in 
terms of rr, .. . r 2n , and t, leaving a single constant (a) to be thus 
determined. Our 2 n — 1 finite equations enable us to regard all the 
variables rq, ... r 2n as functions of a single one, say rq. 

For constant values of b, ... h, we have 


Now 


dr i 


dr i 
da 


da + fq dt. 


(82) 



da dr 2 ... dr 2n = 


dri . . . dr 2n 


d(a,.. . h) 

d(rq,... r 2n ) d(b,...h ) 


d(a,...h ) d(r 2 ,... r 2n ) 


da dr 2 ... dr 


2m 


where the limits of the integrals are formed by the same phases. We 
have therefore 


dr i _ d(r q,... r 2n ) d{b , ...h) 
da d(a,... h ) d(r 2 ,... r 2n ) ’ 

by which equation (82) may be reduced to the form 

d(r 1 ,...r 2n ) _ 1 fq 

d(a, ...h) ~ d(b , ...h) 1 d(b, ...h) 

d(r 2 ,...r 2n ) d(r 2 ,...r 2n ) 


(83) 


dt. 


(84) 
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Now we know by (71) that the coefficient of da is a function 
of a, ... h. Therefore, as b , ... h are regarded as constant in the 
equation, the first number represents the differential of a function 
of a, ... h, which we may denote by a'. We have then 


da' 


_ dr _—_ dt 

d(b,...h ) 1 d(b,...h ) ’ 

d(r 2 , ■ ■ ■ r 2n ) d(r 2 ,... r 2n ) 


(85) 


which may be integrated by quadratures. In this case we may say 
that the principle of conservation of extension-in-phase has supplied 
the ‘multiplier’ 


d(b,...h ) dTl 
d(r 2l . ..r 2n ) 


( 86 ) 


for the integration of the equation 


dr i — f\ dt = 0. 


(87) 


The system of arbitrary constants a', b, ... h has evidently the 
same properties which were noticed in regard to the system a, 

b', ... h. 








IV. 


ON THE DISTRIBUTION IN PHASE CALLED CANONICAL, 
IN WHICH THE INDEX OF PROBABILITY IS A LINEAR 
FUNCTION OF THE ENERGY. 


Let us now give our attention to the statistical equilibrium of en¬ 
sembles of conservation systems, especially to those cases and prop¬ 
erties which promise to throw light on the phenomena of thermo¬ 
dynamics. 

The condition of statistical equilibrium may be expressed in the 
form* 



where P is the coefficient of probability, or the quotient of the 
density-in-phase by the whole number of systems. To satisfy this 
condition, it is necessary and sufficient that P should be a function 
of the p’s and g’s (the momenta and coordinates) which does not 
vary with the time in a moving system. In all cases which we are 
now considering, the energy, or any function of the energy, is such 
a function. 

P = func. (e) 

will therefore satisfy the equation, as indeed appears identically if 
we write it in the form 



dP de 

dpi dpi 


dP de \ 

dqi dip J 


0 . 


*See equations (20), (41), (42), also the paragraph following equation (20). 
The positions of any external bodies which can affect the systems are here 
supposed uniform for all the systems and constant in time. 
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There are, however, other conditions to which P is subject, 
which are not so much conditions of statistical equilibrium, as con¬ 
ditions implicitly involved in the definition of the coefficient of prob¬ 
ability, whether the case is one of equilibrium or not. These are: 
that P should be single-valued, and neither negative nor imaginary 
for any phase, and that expressed by equation (46), viz., 

[ - a - U - f P dpi . . . dq n = 1. (89) 

J phases J 

These considerations exclude 

P = e x constant, 


as well as 

P = constant, 

as cases to be considered. 

The distribution represented by 


77 = logP = —q—, (90) 

or 

ip—e 

P = e~@-, (91) 

where 0 and if} are constants, and 0 positive, seems to represent the 
most simple case conceivable, since it has the property that when 
the system consists of parts with separate energies, the laws of the 
distribution in phase of the separate parts are of the same nature,— 
a property which enormously simplifies the discussion, and is the 
foundation of extremely important relations to thermodynamics. 
The case is not rendered less simple by the divisor 0, (a quantity 
of the same dimensions as e,) but the reverse, since it makes the 
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distribution independent of the units employed. The negative sign 
of e is required by (89), which determines also the value of ^ for 
any given 0, viz., 

e e = / ■ ■ ■ / e © dpi... dq n . (92) 

J phases J 

When an ensemble of systems is distributed in phase in the man¬ 
ner described, i.e., when the index of probability is a linear func¬ 
tion of the energy, we shall say that the ensemble is canonically 
distributed , and shall call the divisor of the energy (0) the modulus 
of distribution. 

The fractional part of an ensemble canonically distributed which 
lies within any given limits of phase is therefore represented by the 
multiple integral 

ijj—e 

e © dpi... dq n (93) 

taken within those limits. We may express the same thing by saying 
that the multiple integral expresses the probability that an unspec¬ 
ified system of the ensemble (i.e., one of which we only know that 
it belongs to the ensemble) falls within the given limits. 

Since the value of a multiple integral of the form (23) (which we 
have called an extension-in-phase) bounded by any given phases is 
independent of the system of coordinates by which it is evaluated, 
the same must be true of the multiple integral in (92), as appears 
at once if we divide up this integral into parts so small that the 
exponential factor may be regarded as constant in each. The value 
of is therefore independent of the system of coordinates employed. 

ft is evident that ijj might be defined as the energy for which the 
coefficient of probability of phase has the value unity. Since however 
this coefficient has the dimensions of the inverse nth power of the 
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product of energy and time,* the energy represented by i/j is not 
independent of the units of energy and time. But when these units 
have been chosen, the definition of tfj will involve the same arbitrary 
constant as e, so that, while in any given case the numerical values 
of if} or e will be entirely indefinite until the zero of energy has 
also been fixed for the system considered, the difference ^ — e will 
represent a perfectly definite amount of energy, which is entirely 
independent of the zero of energy which we may choose to adopt. 

ft is evident that the canonical distribution is entirely deter¬ 
mined by the modulus (considered as a quantity of energy) and the 
nature of the system considered, since when equation (92) is sat¬ 
isfied the value of the multiple integral (93) is independent of the 
units and of the coordinates employed, and of the zero chosen for 
the energy of the system. 

In treating of the canonical distribution, we shall always sup¬ 
pose the multiple integral in equation (92) to have a finite value, as 
otherwise the coefficient of probability vanishes, and the law of dis¬ 
tribution becomes illusory. This will exclude certain cases, but not 
such apparently, as will affect the value of our results with respect 
to their bearing on thermodynamics. It will exclude, for instance, 
cases in which the system or parts of it can be distributed in un¬ 
limited space (or in a space which has limits, but is still infinite in 
volume), while the energy remains beneath a finite limit. It also ex¬ 
cludes many cases in which the energy can decrease without limit, 
as when the system contains material points which attract one an¬ 
other inversely as the squares of their distances. Cases of material 
points attracting each other inversely as the distances would be ex¬ 
cluded for some values of 0, and not for others. The investigation of 
such points is best left to the particular cases. For the purposes of a 


See Chapter I, p. 20. 
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general discussion, it is sufficient to call attention to the assumption 
implicitly involved in the formula (92).* 

The modulus 0 has properties analogous to those of temperature 
in thermodynamics. Let the system A be defined as one of an 
ensemble of systems of m degrees of freedom distributed in phase 
with a probability-coefficient 

4’a-^a 
e e , 

and the system B as one of an ensemble of systems of n degrees of 
freedom distributed in phase with a probability-coefficient 

e e , 

which has the same modulus. Let q \, ... q m , p \, ...p m be the 
coordinates and momenta of A, and q m+ i, ... q m+n , p m + i, • • • Pm+n 
those of B. Now we may regard the systems A and B as together 
forming a system C, having m + n degrees of freedom, and the 
coordinates and momenta q±, ... q m + n , Pi, ■ ■ - Pm+n- The probabil¬ 
ity that the phase of the system C, as thus defined, will fall within 
the limits 

dpi, ... dp m+ni dqi, ... dq m+n 

is evidently the product of the probabilities that the systems A and 
B will each fall within the specified limits, viz., 

Pa+4'b-£a-£b 

e e dp\ ... dp m+n dq { ... dq m+n . (94) 

*It will be observed that similar limitations exist in thermodynamics. In 
order that a mass of gas can be in thermodynamic equilibrium, it is necessary 
that it be enclosed. There is no thermodynamic equilibrium of a (finite) mass of 
gas in an infinite space. Again, that two attracting particles should be able to do 
an infinite amount of work in passing from one configuration (which is regarded 
as possible) to another, is a notion which, although perfectly intelligible in a 
mathematical formula, is quite foreign to our ordinary conceptions of matter. 
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We may therefore regard C as an undetermined system of an en¬ 
semble distributed with the probability-coefficient 

■•Pa+^b-^a+^b) 

e © , (95) 

an ensemble which might be defined as formed by combining each 
system of the first ensemble with each of the second. But since 
ca + Cb is the energy of the whole system, and ip a and ips are 
constants, the probability-coefficient is of the general form which we 
are considering, and the ensemble to which it relates is in statistical 
equilibrium and is canonically distributed. 

This result, however, so far as statistical equilibrium is con¬ 
cerned, is rather nugatory, since conceiving of separate systems as 
forming a single system does not create any interaction between 
them, and if the systems combined belong to ensembles in statistical 
equilibrium, to say that the ensemble formed by such combinations 
as we have supposed is in statistical equilibrium, is only to repeat 
the data in different words. Let us therefore suppose that in form¬ 
ing the system C we add certain forces acting between A and B, 
and having the force-function —6ab- The energy of the system C 
is now 6a + cb + £ab, and an ensemble of such systems distributed 
with a density proportional to 

-(p+ c B+fAB) 

e © (96) 

would be in statistical equilibrium. Comparing this with the 
probability-coefficient of C given above (95), we see that if we 
suppose 6ab (or rather the variable part of this term when we 
consider all possible configurations of the systems A and B) to be 
infinitely small, the actual distribution in phase of C will differ 
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infinitely little from one of statistical equilibrium, which is equiva¬ 
lent to saying that its distribution in phase will vary infinitely little 
even in a time indefinitely prolonged.* The case would be entirely 
different if A and B belonged to ensembles having different moduli, 
say ©a and 0^. The probability-coefficient of C would then be 

i>A-eA 1pB-£B 

e e A + e B } (97) 

which is not approximately proportional to any expression of the 
form (96). 

Before proceeding farther in the investigation of the distribution 
in phase which we have called canonical, it will be interesting to see 
whether the properties with respect to statistical equilibrium which 
have been described are peculiar to it, or whether other distributions 
may have analogous properties. 

Let rf and rf be the indices of probability in two independent 
ensembles which are each in statistical equilibrium, then rf + rf will 
be the index in the ensemble obtained by combining each system 
of the first ensemble with each system of the second. This third 

*It will be observed that the above condition relating to the forces which act 
between the different systems is entirely analogous to that which must hold in 
the corresponding case in thermodynamics. The most simple test of the equality 
of temperature of two bodies is that they remain in equilibrium when brought 
into thermal contact. Direct thermal contact implies molecular forces acting 
between the bodies. Now the test will fail unless the energy of these forces can 
be neglected in comparison with the other energies of the bodies. Thus, in the 
case of energetic chemical action between the bodies, or when the number of 
particles affected by the forces acting between the bodies is not negligible in 
comparison with the whole number of particles (as when the bodies have the 
form of exceedingly thin sheets), the contact of bodies of the same temperature 
may produce considerable thermal disturbance, and thus fail to afford a reliable 
criterion of the equality of temperature. 
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ensemble will of course be in statistical equilibrium, and the function 
of phase rj'+rj" will be a constant of motion. Now when infinitesimal 
forces are added to the compound systems, if rf + rf or a function 
differing infinitesimally from this is still a constant of motion, it 
must be on account of the nature of the forces added, or if their 
action is not entirely specified, on account of conditions to which 
they are subject. Thus, in the case already considered, rj' + rj" is a 
function of the energy of the compound system, and the infinitesimal 
forces added are subject to the law of conservation of energy. 

Another natural supposition in regard to the added forces is that 
they should be such as not to affect the moments of momentum of 
the compound system. To get a case in which moments of mo¬ 
mentum of the compound system shall be constants of motion, we 
may imagine material particles contained in two concentric spheri¬ 
cal shells, being prevented from passing the surfaces bounding the 
shells by repulsions acting always in lines passing through the com¬ 
mon centre of the shells. Then, if there are no forces acting between 
particles in different shells, the mass of particles in each shell will 
have, besides its energy, the moments of momentum about three 
axes through the centre as constants of motion. 

Now let us imagine an ensemble formed by distributing in phase 
the system of particles in one shell according to the index of prob¬ 
ability 


0 + n 2 + Os ’ 


(98) 


where e denotes the energy of the system, and ur, uj 2 , 0 ) 3 , its three 
moments of momentum, and the other letters constants. In like 
manner let us imagine a second ensemble formed by distributing 
in phase the system of particles in the other shell according to the 
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index 


,, e' co'r, 

A ~e + ch + a 2 + cu’ 


(99) 


where the letters have similar significations, and 0, 0], 0 2 , ^3 the 
same values as in the preceding formula. Each of the two ensembles 
will evidently be in statistical equilibrium, and therefore also the 
ensemble of compound systems obtained by combining each system 
of the first ensemble with each of the second. In this third ensemble 
the index of probability will be 


A + A' - 


e + e' | ui i + 

© + nl 


+ 


0J2 + C^2 


+ 


UJ3 + CJ3 

h2 3 


( 100 ) 


where the four numerators represent functions of phase which are 
constants of motion for the compound systems. 

Now if we add in each system of this third ensemble infinitesimal 
conservative forces of attraction or repulsion between particles in 
different shells, determined by the same law for all the systems, the 
functions oq + cu 2 + cu 2 , and cu 3 + cu 3 will remain constants of 
motion, and a function differing infinitely little from + e' will be a 
constant of motion. It would therefore require only an infinitesimal 
change in the distribution in phase of the ensemble of compound 
systems to make it a case of statistical equilibrium. These properties 
are entirely analogous to those of canonical ensembles.* 

*It would not be possible to omit the term relating to energy in the above 
indices, since without this term the condition expressed by equation (89) cannot 
be satisfied. 

The consideration of the above case of statistical equilibrium may be made the 
foundation of the theory of the thermodynamic equilibrium of rotating bodies,— 
a subject which has been treated by Maxwell in his memoir “On Boltzmann’s 
theorem on the average distribution of energy in a system of material points.” 
Cambr. Phil. Trans., vol. XII, p. 547, (1878). 
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Again, if the relations between the forces and the coordinates 
can be expressed by linear equations, there will be certain “normal” 
types of vibration of which the actual motion may be regarded as 
composed, and the whole energy may be divided into parts relating 
separately to vibrations of these different types. These partial ener¬ 
gies will be constants of motion, and if such a system is distributed 
according to an index which is any function of the partial energies, 
the ensemble will be in statistical equilibrium. Let the index be a 
linear function of the partial energies, say 


A-^~ 


( 101 ) 


Let us suppose that we have also a second ensemble composed of 
systems in which the forces are linear functions of the coordinates, 
and distributed in phase according to an index which is a linear 
function of the partial energies relating to the normal types of vi¬ 
bration, say 

A '~w - 4 - ( 102 ) 

Since the two ensembles are both in statistical equilibrium, the 
ensemble formed by combining each system of the first with each 
system of the second will also be in statistical equilibrium. Its 
distribution in phase will be represented by the index 


A + A' - -5- 


(!03) 


and the partial energies represented by the numerators in the for¬ 
mula will be constants of motion of the compound systems which 
form this third ensemble. 

Now if we add to these compound systems infinitesimal forces 
acting between the component systems and subject to the same 
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general law as those already existing, viz., that they are conserva¬ 
tive and linear functions of the coordinates, there will still be n + m 
types of normal vibration, and n + m partial energies which are 
independent constants of motion. If all the original n + m normal 
types of vibration have different periods, the new types of normal 
vibration will differ infinitesimally from the old, and the new par¬ 
tial energies, which are constants of motion, will be nearly the same 
functions of phase as the old. Therefore the distribution in phase 
of the ensemble of compound systems after the addition of the sup¬ 
posed infinitesimal forces will differ infinitesimally from one which 
would be in statistical equilibrium. 

The case is not so simple when some of the normal types of mo¬ 
tion have the same periods. In this case the addition of infinitesimal 
forces may completely change the normal types of motion. But the 
sum of the partial energies for all the original types of vibration 
which have any same period, will be nearly identical (as a function 
of phase, i.e., of the coordinates and momenta,) with the sum of 
the partial energies for the normal types of vibration which have 
the same, or nearly the same, period after the addition of the new 
forces. If, therefore, the partial energies in the indices of the first 
two ensembles (101) and (102) which relate to types of vibration 
having the same periods, have the same divisors, the same will 
be true of the index (103) of the ensemble of compound systems, 
and the distribution represented will differ infinitesimally from one 
which would be in statistical equilibrium after the addition of the 
new forces.* 

*It is interesting to compare the above relations with the laws respecting 
the exchange of energy between bodies by radiation, although the phenomena 
of radiations lie entirely without the scope of the present treatise, in which the 
discussion is limited to systems of a finite number of degrees of freedom. 
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The same would be true if in the indices of each of the original 
ensembles we should substitute for the term or terms relating to 
any period which does not occur in the other ensemble, any func¬ 
tion of the total energy related to that period, subject only to the 
general limitation expressed by equation(89). But in order that the 
ensemble of compound systems (with the added forces) shall always 
be approximately in statistical equilibrium, it is necessary that the 
indices of the original ensembles should be linear functions of those 
partial energies which relate to vibrations of periods common to 
the two ensembles, and that the coefficients of such partial energies 
should be the same in the two indices.* 

The properties of canonically distributed ensembles of systems 
with respect to the equilibrium of the new ensembles which may 
be formed by combining each system of one ensemble with each 
system of another, are therefore not peculiar to them in the sense 
that analogous properties do not belong to some other distributions 
under special limitations in regard to the systems and forces consid¬ 
ered. Yet the canonical distribution evidently constitutes the most 
simple case of the kind, and that for which the relations described 
hold with the least restrictions. 

Returning to the case of the canonical distribution, we shall find 
other analogies with thermodynamic systems, if we suppose, as in 
the preceding chapters,' that the potential energy (e q ) depends not 
only upon the coordinates q±, ... q n which determine the configura¬ 
tion of the system, but also upon certain coordinates cq, 02 , etc. of 

*The above may perhaps be sufficiently illustrated by the simple case where 
n = 1 in each system. If the periods are different in the two systems, they may 
be distributed according to any functions of the energies: but if the periods are 
the same they must be distributed canonically with same modulus in order that 
the compound ensemble with additional forces may be in statistical equilibrium. 

^See especially Chapter I, p. 2. 
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bodies which we call external, meaning by this simply that they are 
not to be regarded as forming any part of the system, although their 
positions affect the forces which act on the system. The forces ex¬ 
erted by the system upon these external bodies will be represented 
by —de q /dai, —dejda^, etc., while —de q /dqi, ... —de q /dq n represent 
all the forces acting upon the bodies of the system, including those 
which depend upon the position of the external bodies, as well as 
those which depend only upon the configuration of the system itself. 
It will be understood that e p depends only upon q\, ... q n pi, ... p n , 
in other words, that the kinetic energy of the bodies which we call 
external forms no part of the kinetic energy of the system. It follows 
that we may write 

de de q 

dai dai 

although a similar equation would not hold for differentiations rel¬ 
ative to the internal coordinates. 

We always suppose these external coordinates to have the same 
values for all systems of any ensemble. In the case of a canonical 
distribution, i.e., when the index of probability of phase is a linear 
function of the energy, it is evident that the values of the external 
coordinates will affect the distribution, since they affect the energy. 
In the equation 



_± r all r _e_ 

e © = / • • ■ / e © dp\ ... dq n , (105) 

J phases J 

by which i[} may be determined, the external coordinates, cq, < 22 , 
etc., contained implicitly in e, as well as 0, are to be regarded as 
constant in the integrations indicated. The equation indicates that 
ip is a function of these constants. If we imagine their values varied, 
and the ensemble distributed canonically according to their new 
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values, we have by differentiation of the equation, 

e-S (-1 # + J) d ej = de I ee-i dp , ... dq n 

1 , /' ail /' de _± 

- — dai / ■ ■ ■ / — e © dpi... dq n 

fe) J phases J CtCl \ 

1 , /“ail /' de_± . . 

——da 2 / ••• / -—e © dpi... dq n — etc., (106) 

0 J phases J da2 


or, multiplying by 0 e ©, and setting 


de de 

= ffi, = A 2 , etc., 

aai dci2 


, , l/j 1 ... /' all f 

— dip + — die) = — die) / • • • / ee © dp i... dq n 

0 0 J phases J 

f all /‘ 

+ da i / • • • / ! e © dpi . . . dq n 

J phases J 

f all f ^~ e 

+ d/i ‘2 ■ ■ ■ A 2 e ® dpi... dq n + etc. (107) 

J phases J 

Now the average value in the ensemble of any quantity (which we 
shall denote in general by a horizontal line above the proper symbol) 
is determined by the equation 


/‘ all [±zl 

«=/■•• I lie © dp i ... dq. n 

J phases J 


(108) 


Comparing this with the preceding equation, we have 

dil) = ^ d@ — — dO — A\ da\ — A 2 da 2 — etc. (109) 
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Or, since 


ip — e 

0 = * 

( 110 ) 

and 


ip — e 

0 =>h 

( 111 ) 


dip = 

fj d@ — A 1 dai — A 2 da -2 — etc. 

( 112 ) 

Moreover, since (111) gives 




dip — de = Q dr} + r/ d@, 

(113) 

we have also 

de = 

-0 dfj — A\ da\ — A 2 da 2 — etc. 

(114) 


This equation, if we neglect the sign of averages, is identical in 
form with the thermodynamic equation 


dc + dd\ + A .2 d &2 etc. 

dr]= T 

(115) 

de = T di} — Ai dai — A 2 da 2 — etc. 

(116) 


which expresses the relation between the energy, temperature, and 
entropy of a body in thermodynamic equilibrium, and the forces 
which it exerts on external bodies,—a relation which is the math¬ 
ematical expression of the second law of thermodynamics for re¬ 
versible changes. The modulus in the statistical equation corre¬ 
sponds to temperature in the thermodynamic equation, and the 
average index of probability with its sign reversed corresponds to 
entropy. Bnt in the thermodynamic equation the entropy (r/) is a 
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quantity which is only defined by the equation itself, and incom¬ 
pletely defined in that the equation only determines its differential, 
and the constant of integration is arbitrary. On the other hand, 
the fj in the statistical equation has been completely defined as the 
average value in a canonical ensemble of systems of the logarithm 
of the coefficient of probability of phase. 

We may also compare equation (112) with the thermodynamic 
equation 

dil> = —r] clT — Ai da\ — A 2 da 2 — etc., (117) 

where ip represents the function obtained by subtracting the product 
of the temperature and entropy from the energy. 

How far, or in what sense, the similarity of these equations con¬ 
stitutes any demonstration of the thermodynamic equations, or ac¬ 
counts for the behavior of material systems, as described in the the¬ 
orems of thermodynamics, is a question of which we shall postpone 
the consideration until we have further investigated the properties 
of an ensemble of systems distributed in phase according to the law 
which we are considering. The analogies which have been pointed 
out will at least supply the motive for this investigation, which will 
naturally commence with the determination of the average values 
in the ensemble of the most important quantities relating to the 
systems, and to the distribution of the ensemble with respect to the 
different values of these quantities. 


V. 


AVERAGE VALUES IN A CANONICAL ENSEMBLE OF 

SYSTEMS. 

In the simple but important case of a system of material points, 
if we use rectangular coordinates, we have for the product of the 
differentials of the coordinates 

dx i dy\ dz\... dx v dy v dz u , 

and for the product of the differentials of the momenta 

m i dx i rrii dij\ rn\ dzi... rn v dx u rn u dy u m v dz u . 

The product of these expressions, which represents an element of 
extension-in-phase, may be briefly written 

m i dx i ... rn v dz v dx\ ... dz v ; 


and the integral 

'Ip—E 

e © mi dxi... m u dz u dx\... dz v (118) 

will represent the probability that a system taken at random from 
an ensemble canonically distributed will fall within any given limits 
of phase. 

In this case 



e = e q + \mi±\ H-b \m v x 2 v , (119) 


-*/>—£ j>—eg mixf m u x1 

e © = e © 2 © ... e~ 2 © . 


and 


(120) 
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The potential energy (e q ) is independent of the velocities, and if 
the limits of integration for the coordinates are independent of the 
velocities, and the limits of the several velocities are independent of 
each other as well as of the coordinates, the multiple integral may 
be resolved into the product of integrals 


e © dx i. 


m\x\ 


dz u e 2 © midxi 


m u z ~ 

e~ 2 © 77i,, dz v 


(121) 

This shows that the probability that the configuration lies within 
any given limits is independent of the velocities, and that the prob¬ 
ability that any component velocity lies within any given limits is 
independent of the other component velocities and of the configu¬ 
ration. 

Since 

/ °° mixf _ 

e~ 2 © rriidii = y27rmi©, 


and 


m\x^ j - 

\m\x\ e~ 2 © m\dx\ = J |7rmi@ 3 , 


( 122 ) 


(123) 


the average value of the part of the kinetic energy clue to the veloc¬ 
ity d'i, which is expressed by the quotient of these integrals, is |0. 
This is true whether the average is taken for the whole ensemble or 
for any particular configuration, whether it is taken without refer¬ 
ence to the other component velocities, or only those systems are 
considered in which the other component velocities have particular 
values or lie within specified limits. 

The number of coordinates is or n. We have, therefore, for 
the average value of the kinetic energy of a system 


e p = |z/0 = \nQ. 


(124) 
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This is equally true whether we take the average for the whole en¬ 
semble, or limit the average to a single configuration. 

The distribution of the systems with respect to their component 
velocities follows the ‘law of errors’; the probability that the value 
of any component velocity lies within any given limits being repre¬ 
sented by the value of the corresponding integral in (121) for those 
limits, divided by ( 27 rm@) 2 , which is the value of the same integral 
for infinite limits. Thus the probability that the value of aq lies 
between any given limits is expressed by 



(125) 


The expression becomes more simple when the velocity is expressed 
with reference to the energy involved. If we set 



the probability that s lies between any given limits is expressed by 



(126) 


Here s is the ratio of the component velocity to that which would 
give the energy 0; in other words, s 2 is the quotient of the energy 
due to the component velocity divided by 0. The distribution with 
respect to the partial energies due to the component velocities is 
therefore the same for all the component velocities. 

The probability that the configuration lies within any given lim¬ 
its is expressed by the value of 



(127) 
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for those limits, where M denotes the product of all the masses. 
This is derived from (121) by substitution of the values of the inte¬ 
grals relating to velocities taken for infinite limits. 

Very similar results may be obtained in the general case of a 
conservative system of n degrees of freedom. Since e p is a homoge¬ 
neous quadratic function of the p’s, it may be divided into parts by 
the formula 

ep = bl ^ + ... + bn ^ ( 128) 
where e might be written for e p in the differential coefficients without 
affecting the signification. The average value of the first of these 
parts, for any given configuration, is expressed by the quotient 



i de ±zl , 
oPi ~T~ e 0 dp 1... dp n 
dpi 


'ip—e 

e~e~ 


dpi... dp n 


(129) 


Now we have by integration by parts 


de , _ 

Pie © - dpi = 0 

dpi 


f°° p-e 

e © dpi. 


(130) 


By substitution of this value, the above quotient reduces to ^0, 
which is therefore the average value of \pi for the given con- 

- dpi 

figuration. Since this value is independent of the configuration, it 
must also be the average for the whole ensemble, as might easily be 
proved directly. (To make the preceding proof apply directly to the 
whole ensemble, we have only to write dpi... dq n for dpi . .. dp n in 
the multiple integrals.) This gives \ n@ for the average value of the 
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whole kinetic energy for any given configuration, or for the whole 
ensemble, as has already been proved in the case of material points. 

The mechanical significance of the several parts into which the 
kinetic energy is divided in equation (128) will be apparent if we 
imagine that by the application of suitable forces (different from 
those derived from e q and so much greater that the latter may be 
neglected in comparison) the system was brought from rest to the 
state of motion considered, so rapidly that the configuration was not 
sensibly altered during the process, and in such a manner also that 
the ratios of the component velocities were constant in the process. 
If we write 

Fi dqi + ■ ■ ■ + F n dq n 

for the moment of these forces, we have for the period of their action 
by equation (3) 


■ _ ( K de <i , p _ <*e 

Pl — ---h Pi — - r r 

dqi dqi dq\ 

The work done by the force F\ may be evaluated as follows: 

J F, dqi = I pi dqi + J dq u 


where the last term may be cancelled because the configuration does 
not vary sensibly during the application of the forces. (It will be 
observed that the other terms contain factors which increase as the 
time of the action of the forces is diminished.) We have therefore, 


Fi dqi = / piqi dt = 


q\ dpi = — 
Pi 


pi dpi. 


(131) 


For since the p 's are linear functions of the q 's (with coefficients 
involving the g’s) the supposed constancy of the q's and of the ratios 
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of the q 1 s will make the ratio q\jp\ constant. The last integral is 
evidently to be taken between the limits zero and the value of p\ 
in the phase originally considered, and the quantities before the 
integral sign may be taken as relating to that phase. We have 
therefore 

/ dc 

F\ dqi = \pxqi = \ Pl -^-. (132) 

That is: the several parts into which the kinetic energy is divided 
in equation (128) represent the amounts of energy communicated 
to the system by the several forces F \, ... F n under the conditions 
mentioned. 

The following transformation will not only give the value of the 
average kinetic energy, but will also serve to separate the distribu¬ 
tion of the ensemble in configuration from its distribution in velocity. 

Since 2e p is a homogeneous quadratic function of the p’s, which 
is incapable of a negative value, it can always be expressed (and 
in more than one way) as a sum of squares of linear functions of 
the p’s.* The coefficients in these linear functions, like those in the 
quadratic function, must be regarded in the general case as functions 
of the g’s. Let 

2e p = u± + U 2 + • • • + u n (133) 

where u±, .. .u n are such linear functions of the p’s. If we write 

djjPl ■■■Pn) 
d(yXl\ ... u n ) 


for the Jacobian or determinant of the differential coefficients of the 

*The reduction requires only the repeated application of the process of 
‘completing the square’ used in the solution of quadratic equations. 
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form dp/du, we may substitute 

•• -Pn) 

...u n 
for 

dp] ■ ■ ■ dp n 

under the multiple integral sign in any of onr formulae, ft will be 
observed that this determinant is function of the q 's alone. The sign 
of such a determinant depends on the relative order of the variables 
in the numerator and denominator. But since the suffixes of the u’s 
are only used to distinguish these functions from one another, and 
no especial relation is supposed between a p and a u which have the 
same suffix, we may evidently, without loss of generality, suppose 
the suffixes so applied that the determinant is positive. 

Since the u’s are linear functions of the p’s, when the integrations 
are to cover all values of the p’s (for constant g’s) once and only 
once, they must cover all values of the u’s once and only once, and 
the limits will be ±00 for all the u’s. Without the supposition of 
the last paragraph the upper limits would not always be + 00 , as is 
evident on considering the effect of changing the sign of a u. But 
with the supposition which we have made (that the determinant 
is always positive) we may make the upper limits +00 and the 
lower — 00 for all the u’s. Analogous considerations will apply where 
the integrations do not cover all values of the p’s and therefore of 
the u’s. The integrals may always be taken from a less to a greater 
value of a u. 

The general integral which expresses the fractional part of the 
ensemble which falls within any given limits of phase is thus reduced 
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to the form 



d(pi...Pn) c - " ?+ 2 ' e +U " 

d(u\ ... u n ^) 


du i . .. du n dqi... dq n . 


(134) 


For the average value of the part of the kinetic energy which 
is represented by whether the average is taken for the whole 
ensemble, or for a given configuration, we have therefore 


iw 2 
2 “1 


|n 2 e 2 © du i 



e 2 © 


( ^7T0 3 ) 2 
( 2710)5 


0 

2 "’ 


(135) 


and for the average of the whole kinetic energy, |n0, as before. 

The fractional part of the ensemble which lies within any given 
limits of configuration, is found by integrating (134) with respect to 
the id s from — oo to +oo. This gives 

(136) 

which shows that the value of the Jacobian is independent of the 
manner in which 2e p is divided into a sum of squares. We may 
verify this directly, and at the same time obtain a more convenient 
expression for the Jacobian, as follows. 

It will be observed that since the w’S are linear functions of 
the p’s, and the p’s linear functions of the q 1 s, the id s will be 
linear functions of the g’s, so that a differential coefficient of the 
form du/dq will be independent of the g’s, and function of the g’s 
alone. Let us write dp x /du y for the general element of the Jacobian 
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determinant. We have 


dPx 

dUy 


d de d de du r 

du y dq x du y du r dq x 



d 2 e du r 


du y du r dq 3 


d de 
dq x du y 


du y 

dq x ' 


(137) 


Therefore 

djjpi ■ ■ ■ p n ) _ djui ...u n ) 
d(v,\ . . . Un) d(q i...q n ) 

and 

/ d(p i ■ ■. p„) \ 2 _ f \ 2 _ d(pi ... p„) 

\d(u I ...u n )J \d(q 1 ...g n )) dug . </„) 


(138) 


(139) 


These determinants are all functions of the g’s alone.* The last 
is evidently the Hessian or determinant formed of the second dif¬ 
ferential coefficients of the kinetic energy with respect to qi, ... q n . 
We shall denote it by Ag. The reciprocal determinant 

d(qi ■ ■ ■ q n ) 
d(pi ...PnY 

which is the Hessian of the kinetic energy regarded as function of 
the p’s, we shall denote by A p . 

*It will be observed that the proof of (137) depends on the linear rela- 

dlly 

tion between the zt’s and q 1 s, which makes constant with respect to the 

d(jx 

differentiations here considered. Compare note on p. 11. 
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If we set 


_tp 
e e 




e 0 A p dp\ ... dp r 


-u\ - uj 

e 20 dui... du n 


n 

(2jt©) 2 , 


(140) 


and 

Vv = ip - i/j p , (141) 

the fractional part of the ensemble which lies within any given limits 
of configuration (136) may be written 


/ 



A| dq i . . . dq n , 


(142) 


where the constant ij} q may be determined by the condition that the 
integral extended over all configurations has the value unity.* 

*In the simple but important case in which A,} is independent of the g’s, 
and e q a quadratic function of the q’s, if we write e a for the least value of e q (or 
of e) consistent with the given values of the external coordinates, the equation 
determining il> q may be written 

1 r°° r°° (e 9 -^e a ) 

e e = AJ / • • • / e e dq x ... dq n . 

J —GO J —GO 


If we denote by ... q' n the values of q x , ... q n which give e q its least value e a , 
it is evident that e q — e a is a homogeneous quadratic function of the differences 
q x — q[, etc., and that dq x , ... dq n may be regarded as the differentials of these 
differences. The evaluation of this integral is therefore analytically similar to 
that of the integral 

r-oo r-oo 

/ ••• / e 0 dpi...dp n , 


' —OO «/ —OO 


for which we have found the value A p 2 (27T0) 2 . By the same method, or by 
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analogy, we get 



where A q is the Hessian of the potential energy as function of the q' s. It will be 


observed that A q depends on the forces of the system and is independent of the 
masses, while A q or its reciprocal A p depends on the masses and is indepen¬ 
dent of the forces. While each Hessian depends on the system of coordinates 
employed, the ratio A q /A q is the same for all systems. 

Multiplying the last equation by (140), we have 



For the average value of the potential energy, we have 



The evaluation of this expression is similar to that of 



which expresses the average value of the kinetic energy, and for which we have 
found the value ^n&. We have accordingly 



Adding the equation 


6p — 2 ^® 5 


we have 


e — e a = 710. 
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When an ensemble of systems is distributed in configuration in 
the manner indicated in this formula, i.e., when its distribution in 
configuration is the same as that of an ensemble canonically dis¬ 
tributed in phase, we shall say, without any reference to its veloci¬ 
ties, that it is canonically distributed in configuration. 

For any given configuration, the fractional part of the systems 
which lie within any given limits of velocity is represented by the 
quotient of the multiple integral 

e © dpi... dp n , 



or its equivalent 



1 

A? d/u i ... du n , 


taken within those limits divided by the value of the same integral 
for the limits ±oo. But the value, of the second multiple integral 
for the limits ±oo is evidently 

l n 

A? (2*6) 5. 

We may therefore write 


or 


or again 



4>p-e p 


e © d/u i ... du n , 

(143) 



© A p 2 dpi ... dp n , 

(144) 

p .1 


© A? dq i . . . dq n , 

(145) 
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for the fractional part of the systems of any given configuration 
which lie within given limits of velocity. 

When systems are distributed in velocity according to these for¬ 
mulae, i.e., when the distribution in velocity is like that in an en¬ 
semble which is canonically distributed in phase, we shall say that 
they are canonically distributed in velocity. 

The fractional part of the whole ensemble which falls within any 
given limits of phase, which we have before expressed in the form 

0—e 

e © dpi ... dp n dqi... dq n , (146) 

may also be expressed in the form 



0—e 


e 0 Ag dqi. .. dq n dqi . . . dq n 


(147) 



VI. 


EXTENSION IN CONFIGURATION AND EXTENSION IN 

VELOCITY. 

The formulae relating to canonical ensembles in the closing para¬ 
graphs of the last chapter suggest certain general notions and prin¬ 
ciples, which we shall consider in this chapter, and which are not at 
all limited in their application to the canonical law of distribution.* 
We have seen in Chapter IV. that the nature of the distribu¬ 
tion which we have called canonical is independent of the system of 
coordinates by which it is described, being determined entirely by 
the modulus. It follows that the value represented by the multiple 
integral (142), which is the fractional part of the ensemble which lies 
within certain limiting configurations, is independent of the system 
of coordinates, being determined entirely by the limiting configura¬ 
tions with the modulus. Now -0, as we have already seen, represents 
a value which is independent of the system of coordinates by which 
it is defined. The same is evidently true of i/) p by equation (140), 
and therefore, by (141), of ij} q . Hence the exponential factor in the 
multiple integral (142) represents a value which is independent of 
the system of coordinates. It follows that the value of a multiple 
integral of the form 


j " ■ -J A? dq x ... dq n (148) 

* These notions and principles are in fact such as a more logical arrange¬ 
ment of the subject would place in connection with those of Chapter I., to 
which they are closely related. The strict requirements of logical order have 
been sacrificed to the natural development of the subject, and very elementary 
notions have been left until they have presented themselves in the study of the 
leading problems. 
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is independent of the system of coordinates which is employed for 
its evaluation, as will appear at once, if we suppose the multiple 
integral to be broken up into parts so small that the exponential 
factor may be regarded as constant in each. 

In the same way the formulae (144) and (145) which express 
the probability that a system (in a canonical ensemble) of given 
configuration will fall within certain limits of velocity, show that 
multiple integrals of the form 


J • • J A p 2 dp\ ... dp n (149) 

or 

j • ■ j A? dqi . . . dq n (150) 

relating to velocities possible for a given configuration, when the 
limits are formed by given velocities, have values independent of 
the system of coordinates employed. 

These relations may easily be verified directly. It has already 
been proved that 

djPu ■ ■ ■ Pn) _ d(gi, ...q n ) _ d(qi, ...q n ) 
d(pi, • • • Pn) d(Qi, . . . Qn) d(Q i, . . . Qn) 


where qi, ■ ■ ■ q n , Pi, ■ ■ ■ Pn and Q i, ... Q n , P±, ... P n are two systems 
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of coordinates and momenta.* It follows that 


uypi , . . 

d(qi,.. 

■Pn) 1 
Qn)) 

'd(p i,. 

■ Pn ) 

v d(qi, ■ 

• Qn) 

( d(p i,. 

■Pn) 

v d(gi,. 

■Qn) 


f d{P u ...P n ) 

\d(Qi,... Qn) 


dqi...dq n 

1 

2 d(qi, ...q n ) 
d(Ql , ■ ■ ■ Qn) 


dQ i... dQ ji 


f d{P \, ■ 

■■ Pn) Y\ 

( d(qi,. 

■■Qn) \ 

\d(pi,. 

■■Pn) ) 

\d(Qi,. 

■■Qn)) 


dQ i... dQ ^ 


dQ i ■ • ■ dQ yj 


and 



/ d(Qi,.. .Q n )\ 
{d(P U ...P n ) ) 


dPi... dP n 


f f ( d(Qi,...Q n ) \ 
J'"J \d(Pi,...P n ) ) 


f [ ( d(Q u ...Q„) \ 

J 'J \ d(P u ...P n ) ) 


d(Pi,.. 

d(pi,■■ 

.Pn), 

dp i. 

■Pn) 

■ ■ dp n 


(d(Pi, 

■■■Pn) y 

/ d(q lt . 

■■Qn) \ 

V d(pi, 

■■■Pn) ) 

\d(Qi,. 

■■Qn)) 


dpi... dp n 



f d{qI,...q n ) \ 
\d(pi,...p n )J 


dpi... dp n 


The multiple integral 


dpi... dp n dqi. . . dq n , 


which may also be written 

■ A gdq!.. .dq n dqi.. .dq n , 


(151) 


(152) 


See equation (29). 
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and which, when taken within any given limits of phase, has been 
shown to have a value independent of the coordinates employed, ex¬ 
presses what we have called an extension-in-phase* In like manner 
we may say that the multiple integral (148) expresses an extension- 
in-configuration, and that the multiple integrals (149) and (150) 
express an extension-in-velocity. We have called 

dpi... dp n dqi... dq n , (153) 

which is equivalent to 

Ag dtp ... dq n dq 1 ... dq n , (154) 

an element of extension-in-phase. We may call 

A? dqi... dq n (155) 

an element of extension-in-configuration, and 

Ajl d;p\ ... dp n , (156) 

or its equivalent 

A? dqi... dq n , (157) 

an element of extension-in-velocity. 

An extension-in-phase may always be regarded as an integral 
of elementary extensions-in-conhguration multiplied each by an 
extension-in-velocity. This is evident from the formulae (151) and 
(152) which express an extension-in-phase, if we imagine the inte¬ 
grations relative to velocity to be first carried out. 


See Chapter I, p. 9. 
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The product of the two expressions for an element of extension- 
in-velocity (149) and (150) is evidently of the same dimensions as 
the product 

Pi ... p n q i ...q n , 

that is, as the nth power of energy, since every product of the 
form p\(p has the dimensions of energy. Therefore an extension- 
in-velocity has the dimensions of the square root of the nth power 
of energy. Again we see by (155) and (156) that the product of an 
extension-in-configuration and an extension-in-velocity have the di¬ 
mensions of the nth power of energy multiplied by the nth power of 
time. Therefore an extension-in-configuration has the dimensions of 
the nth power of time multiplied by the square root of the nth power 
of energy. 

To the notion of extension-in-configuration there attach them¬ 
selves certain other notions analogous to those which have presented 
themselves in connection with the notion of extension-in-phase. The 
number of systems of any ensemble (whether distributed canoni¬ 
cally or in any other manner) which are contained in an element of 
extension-in-configuration, divided by the numerical value of that 
element, may be called the density-in-configuration. That is, if a 
certain configuration is specified by the coordinates q\ , ... q n , and 
the number of systems of which the coordinates fall between the 
limits q± and q\ + dqi, ... q n and q n + dq n is expressed by 

D q A? dq x ... dq n , (158) 

D q will be the density-in-configuration. And if we set 


(159) 
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where N denotes, as usual, the total number of systems in the 
ensemble, the probability that an unspecified system of the ensemble 
will fall within the given limits of configuration, is expressed by 

e Vq A? dq x ... dq n . (160) 

We may call e Vq the coefficient of probability of the configuration, 
and r] q the index of probability of the configuration. 

The fractional part of the whole number of systems which are 
within any given limits of configuration will be expressed by the 
multiple integral 

J.. .J e n« A? d/p ... dq n . (161) 

The value of this integral (taken within any given configurations) 
is therefore independent of the system of coordinates which is used. 
Since the same has been proved of the same integral without the 
factor e Vq , it follows that the values of r) q and D q for a given con¬ 
figuration in a given ensemble are independent of the system of 
coordinates which is used. 

The notion of extension-in-velocity relates to systems having the 
same configuration.* If an ensemble is distributed both in configura¬ 
tion and in velocity, we may confine our attention to those systems 

*Except in some simple cases, such as a system of material points, we cannot 
compare velocities in one configuration with velocities in another, and speak of 
their identity or difference except in a sense entirely artificial. We may indeed 
say that we call the velocities in one configuration the same as those in another 
when the quantities q-\ , ... q n have the same values in the two cases. But this 
signifies nothing until the system of coordinates has been defined. We might 
identify the velocities in the two cases which make the quantities p \, ... p n the 
same in each. This again would signify nothing independently of the system of 
coordinates employed. 
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which are contained within certain infinitesimal limits of configu¬ 
ration, and compare the whole number of such systems with those 
which are also contained within certain infinitesimal limits of veloc¬ 
ity. The second of these numbers divided by the first expresses the 
probability that a system which is only specified as falling within 
the infinitesimal limits of configuration shall also fall within the in¬ 
finitesimal limits of velocity. If the limits with respect to velocity are 
expressed by the condition that the momenta shall fall between the 
limits pi and p\ + dpi, ...p n and p n + dp n the extension-in-velocity 
within those limits will be 

1 

Ap dp\ ... dp n , 

and we may express the probability in question by 

e Vp Ap dpi... dp n . (162) 

This may be regarded as defining q p . 

The probability that a system which is only specified as having a 
configuration within certain infinitesimal limits shall also fall within 
any given limits of velocity will be expressed by the multiple integral 

e Vp Ap dpi... dp n , (163) 

or its equivalent 

J • • J e Vp A? dqi... dq n , (164) 

taken within the given limits. 

It follows that the probability that the system will fall within 
the limits of velocity, q\ and qi + dqi, ... q n and q n + dq n is expressed 
by 

e Vp A? dqi... dq n . 



(165) 
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The value of the integrals (163), (164) is independent of the 
system of coordinates and momenta which is used, as is also the 
value of the same integrals without the factor e Vp \ therefore the 
value of rj p must be independent of the system of coordinates and 
momenta. We may call e Vp the coefficient of probability of velocity , 
and the index of probability of velocity. 

Comparing (160) and (162) with (40), we get 

e nq e np = P = e ri (166) 

or 

Vg + V P = V- (167) 

That is: the product of the coefficients of probability of configu¬ 
ration and of velocity is equal to the coefficient of probability of 
phase; the sum of the indices of probability of configuration and of 
velocity is equal to the index of probability of phase. 

It is evident that e nq and e Vp have the dimensions of the recipro¬ 
cals of extension-in-configuration and extension-in-velocity respec- 

n n 

tively, i.e., the dimensions of t~ n e~ 2 and e _ 2 ; where t represents 
any time, and e any energy. If, therefore, the unit of time is multi¬ 
plied by c t , and the unit of energy by c e , every r] q will be increased 
by the addition of 

n log c t + \n log c e , (168) 

and every r] p by the addition of 

\n\ogc e .* (169) 


Compare (47) in Chapter I. 
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It should be observed that the quantities which have been called 
extension-in-configuration and extension-in-velocity are not, as the 
terms might seem to imply, purely geometrical or kinematical con¬ 
ceptions. To express their nature more fully, they might appropri¬ 
ately have been called, respectively, the dynamical measure of the 
extension in configuration , and the dynamical measure of the exten¬ 
sion in velocity. They depend upon the masses, although not upon 
the forces of the system. In the simple case of material points, where 
each point is limited to a given space, the extension-in-configuration 
is the product of the volumes within which the several points are 
confined (these may be the same or different), multiplied by the 
square root of the cube of the product of the masses of the several 
points. The extension-in-velocity for such systems is most easily de¬ 
fined as the extension-in-configuration of systems which have moved 
from the same configuration for the unit of time with the given ve¬ 
locities. 

In the general case, the notions of extension-in-configuration and 
extension-in-velocity may be connected as follows. 

If an ensemble of similar systems of n degrees of freedom have the 
same configuration at a given instant, but are distributed through¬ 
out any finite extension-in-velocity, the same ensemble after an in¬ 
finitesimal interval of time St will be distributed throughout an ex¬ 
tension in configuration equal to its original extension-in-velocity 
multiplied by 5t n . 

In demonstrating this theorem, we shall write q \, ... q' n for the 
initial values of the coordinates. The final values will evidently be 
connected with the initial by the equations 

Q\ - q[ = <ii8t, . . . , q n — q' n = q n St. (170) 

Now the original extension-in-velocity is by definition represented 
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by the integral 

J-J A| dqi... dq n , (171) 

where the limits may be expressed by an equation of the form 

F(q 1 ,...q n )= 0. (172) 

The same integral multiplied by the constant St n may be written 

J • • J A? d(ch St)... d(q n St), (173) 

and the limits may be written 

F(qi,. . . q n ) = f(q, St,... q n St) = 0. (174) 

(It will be observed that St as well as Ag is constant in the integra¬ 
tions.) Now this integral is identically equal to 

J ' ' J A | d(q 1 - q[) . . . d(q n - q' n ), (175) 

or its equivalent 

J- ■ ■ J A? dq x ... dq n , (176) 

with limits expressed by the equation 

f{qi^q[,...qn-q' n ) = 0- (177) 

But the systems which initially had velocities satisfying the equa¬ 
tion (172) will after the interval St have configurations satisfying 
equation (177). Therefore the extension-in-configuration repre¬ 
sented by the last integral is that which belongs to the systems 
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which originally had the extension-in-velocity represented by the 
integral (171). 

Since the quantities which we have called extensions-in-phase, 
extensions-in-configuration, and extensions-in-velocity are indepen¬ 
dent of the nature of the system of coordinates used in their defi¬ 
nitions, it is natural to seek definitions which shall be independent 
of the use of any coordinates. It will be sufficient to give the fol¬ 
lowing definitions without formal proof of their equivalence with 
those given above, since they are less convenient for use than those 
founded on systems of coordinates, and since we shall in fact have 
no occasion to use them. 

We commence with the definition of extension-in-velocity. We 
may imagine n independent velocities, Vi , ... V n of which a system 
in a given configuration is capable. We may conceive of the system 
as having a certain velocity Vo combined with a part of each of 
these velocities V\, ... V n . By a part of V\ is meant a velocity of 
the same nature as V\ but in amount being anything between zero 
and V\. Now all the velocities which may be thus described may 
be regarded as forming or lying in a certain extension of which we 
desire a measure. The case is greatly simplified if we suppose that 
certain relations exist between the velocities V \, ... V n viz.: that 
the kinetic energy due to any two of these velocities combined is 
the sum of the kinetic energies due to the velocities separately. In 
this case the extension-in-motion is the square root of the product of 
the doubled kinetic energies due to the n velocities Vf, .. .V n taken 
separately. 

The more general case may be reduced to this simpler case as 
follows. The velocity V 2 may always be regarded as composed of two 
velocities V.[ and Vff, of which V.[ is of the same nature as V±, (it may 
be more or less in amount, or opposite in sign,) while V" satisfies the 
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relation that the kinetic energy cine to V\ and V" combined is the 
sum of the kinetic energies due to these velocities taken separately. 
And the velocity V 3 may be regarded as compounded of three, 

I 3 ", V"', of which VI is of the same nature as V±, V” of the same 
nature as V^, while V"' satisfies the relations that if combined either 
with V or V" the kinetic energy of the combined velocities is the 
sum of the kinetic energies of the velocities taken separately. When 
all the velocities V 2 , ... V n have been thus decomposed, the square 
root of the product of the doubled kinetic energies of the several 
velocities Vi, V". V-’” , etc., will be the value of the extension-in- 
vclocity which is sought. 

This method of evaluation of the extension-in-velocity which we 
are considering is perhaps the most simple and natural, but the re¬ 
sult may be expressed in a more symmetrical form. Let us write 612 
for the kinetic energy of the velocities V\ and V 2 combined, dimin¬ 
ished by the sum of the kinetic energies due to the same velocities 
taken separately. This may be called the mutual energy of the 
velocities V\ and V 2 . Let the mutual energy of every pair of the 
velocities V\, ... W be expressed in the same way. Analogy would 
make en represent the energy of twice V\ diminished by twice the 
energy of V , i.e., en would represent twice the energy of Vi, al¬ 
though the term mutual energy is hardly appropriate to this case. 
At all events, let en have this signification, and 622 represent twice 
the energy of V 2 , etc. The square root of the determinant 


£ll 

£12 ' ' 

tin 

£21 

£22 ' ■ 

t 2 n 

1 

£n 2 ‘ ' 

t-nn 


represents the value of the extension-in-velocity determined as 
above described by the velocities V\ , ... V n . 
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The statements of the preceding paragraph may be readily 
proved from the expression (157) on page 68, viz., 

1 

A? dq { . . . dq n 


by which the notion of an element of extension-in-velocity was 
originally defined. Since A q in this expression represents the deter¬ 
minant of which the general element is 

d 2 e 

dqi dqj 

the square of the preceding expression represents the determinant 
of which the general element is 


d 2 e 

dqi dqj 


dqi dqj. 


Now we may regard the differentials of velocity dqi, dqj as them¬ 
selves infinitesimal velocities. Then the last expression represents 
the mutual energy of these velocities, and 


d 2 e 



represents twice the energy due to the velocity dqi. 

The case which we have considered is an extension-in-velocity of 
the simplest form. All extensions-in-velocity do not have this form, 
but all may be regarded as composed of elementary extensions of 
this form, in the same manner as all volumes may be regarded as 
composed of elementary parallelepipeds. 

Having thus a measure of extension-in-velocity founded, it will 
be observed, on the dynamical notion of kinetic energy, and not 
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involving an explicit mention of coordinates, we may derive from it 
a measure of extension-in-configuration by the principle connecting 
these quantities which has been given in a preceding paragraph of 
this chapter. 

The measure of extension-in-phase may be obtained from that of 
extension-in-configuration and of extension-in-velocity. For to every 
configuration in an extension-in-phase there will belong a certain 
extension-in-velocity, and the integral of the elements of extension- 
in-configuration within any extension-in-phase multiplied each by 
its extension-in-velocity is the measure of the extension-in-phase. 



VII. 


FARTHER DISCUSSION OF AVERAGES IN A CANONICAL 
ENSEMBLE OF SYSTEMS. 


Returning to the case of a canonical distribution, we have for the 
index of probability of configuration 


Vg 


V? ~ Q 

0 


(178) 


as appears on comparison of formulae (142) and (161). It follows 
immediately from (142) that the average value in the ensemble of 
any quantity u which depends on the configuration alone is given 
by the formula 


u = 


/ all 

config. ( 


j’q-tq I 

ue 0 AJ dqi .. .dq n , 


(179) 


where the integrations cover all possible configurations. The value 
of ijjq is evidently determined by the equation 


_ta_ 

e © = 


/ all 
config. 


‘ _££ 1 

e ©A ? dqi... dq n 


(180) 


By differentiating the last equation we may obtain results anal¬ 
ogous to those obtained in Chapter IV. from the equation 


± 

e e = 


/ all 

phases 


e © dpi.. . dq n . 

As the process is identical, it is sufficient to give the results: 
di/jq = fj q dQ — Ai dai — A 2 da 2 — etc., 


(181) 
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or, since 

and 


de q = 


^q = e q + 0m, 


dijj q = de q + rj q dO + 0 drj q , 

-0 drj Q — Ai dai — A 2 da 2 — etc. 


(182) 

(183) 

(184) 


It appears from this equation that the differential relations subsist¬ 
ing between the average potential energy in an ensemble of systems 
canonically distributed, the modulus of distribution, the average in¬ 
dex of probability of configuration, taken negatively, and the average 
forces exerted on external bodies, are equivalent to those enunciated 
by Clausius for the potential energy of a body, its temperature, a 
quantity which he called the disgregation, and the forces exerted on 
external bodies.* 

For the index of probability of velocity, in the case of canonical 
distribution, we have by comparison of (144) and (163), or of (145) 
and (164), 


Vp 


which gives 


we have also 


and by (140), 


V P = 


0 


^ P ~ e 


p. 


0 


e p = \n&, 

V'p — — !tt01og(27T0). 


(185) 

(186) 

(187) 

(188) 


■Pogg. Ann. Bd. CXVI, S. 73, (1862); ibid., Bd. CXXV, S. 353, (1865), 
See also Boltzmann, Sitzb. der Wiener Akad., Bd. LXIII, S. 728, (1871). 
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From these equations we get by differentiation 


di> p = rj p dO, 

(189) 

and 


dip = —0 drj p . 

(190) 


The differential relation expressed in this equation between the av¬ 
erage kinetic energy, the modulus, and the average index of prob¬ 
ability of velocity, taken negatively, is identical with that given by 
Clausius locis citatis for the kinetic energy of a body, the temper¬ 
ature, and a quantity which he called the transformation-value of 
the kinetic energy.* The relations 

e = I q + e p , fj = rjq + r] p 

are also identical with those given by Clausius for the corresponding 
quantities. 

Equations (112) and (181) show that if -0 or i\) q is known as 
function of 0 and cq, a 2 , etc., we can obtain by differentiation 
I or I q , and A\, A 2 , etc. as functions of the same variables. We have 
in fact 

e = 0-0rJ = 0-0^, (191) 

e q = i\) q - Qfj q = 0 9 - 0 (192) 

The corresponding equation relating to kinetic energy, 

e P = 0p-0r? p = 0p-0^, (193) 


Verwandlungswerth des Warmeinhaltes. 
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which may be obtained in the same way, may be verified by the 
known relations (186), (187), and (188) between the variables. We 
have also 

A - ^ 
dci\ d(l\ 

etc., so that the average values of the external forces may be derived 
alike from 0 or from 0 g . 

The average values of the squares or higher powers of the energies 
(total, potential, or kinetic) may easily be obtained by repeated 
differentiations of 0 , 0 9 , ijjp, or e, e q , e p , with respect to 0. By 
equation (108) we have 



all 


'ip—e 

e=/--- / ee~&~ dpi... dq n , 

J phases J 


and differentiating with respect to 0, 
de f ail f f e 2 — -0e e dxfj \ 


dG 


/ all 
phases 


e 2 + § d» |e 9 dp '-" 


whence, again by (108), 

de 

dG 


-2 


©2 


ibe e dib 

+ e de’ 


or 


de 


e 2 = G~ + 6-0 — 0 — . 


dG 


d0 


dG 


Combining this with (191), 




(195) 


(196) 


(197) 


( 198 ) 
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In precisely the same way, from the equation 


e 


9 


/ all 

config. 


4>q-£q 1 

e q e & A? dq 1 . . . dq ni 


we may obtain 


-^ = f + Q* d k 
q q+ dG 



(-)3 ^ 
dG) dG 2 ' 


(199) 


( 200 ) 


In the same way also, if we confine ourselves to a particular 
configuration, from the equation 


e p ~ 


/ all 
veloc. 


^p e p l 


e p e e Ap dpi... dp n) 


( 201 ) 


we obtain 


-? = ?? + e 2 $MA-e 


dG 


dijjp 

dG 


n.3 dSpp 

dG 2 ' 


( 202 ) 


which by (187) reduces to 


4=(y + y &2 


(203) 


Since this value is independent of the configuration, we see that the 
average square of the kinetic energy for every configuration is the 
same, and therefore the same as for the whole ensemble. Hence 
e 2 may be interpreted as the average either for any particular con¬ 
figuration, or for the whole ensemble. It will be observed that the 
value of this quantity is determined entirely by the modulus and the 
number of degrees of freedom of the system, and is in other respects 
independent of the nature of the system. 
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Of especial importance are the anomalies of the energies, or their 
deviations from their average values. The average value of these 
anomalies is of course zero. The natural measure of such anomalies 
is the square root of their average square. Now 

(e - e) 2 = e 2 - e 2 , (204) 


identically. Accordingly 




d2 ^ 

de 2 ' 


In like manner, 


(205) 


de 


(e g -e,) 2 = 0 2 ^ = -0 3 


de 


d 2 ^ q 

d0^’ 


) dfr 


d 2 i>p 


( e -e) 2 = @ 2 —^ = -0 3 —lp = 4n0 2 

1 p £pJ u dO dO 2 n 


(206) 

(207) 


Hence 

(e - e) 2 = (e q - e q ) 2 + (e p - e p ) 2 . (208) 

Equation (206) shows that the value of de q /de can never be nega¬ 
tive, and that the value of d 2 ^ g /d0 2 or drj q /de can never be posi¬ 
tive.* 

To get an idea of the order of magnitude of these quantities, we 
may use the average kinetic energy as a term of comparison, this 

*In the case discussed in the note on page 61, in which the potential energy 
is a quadratic function of the q' s, and A,j independent of the g’s, we should get 
for the potential energy 

(e q -e q ) 2 = in© 2 , 


and for the total energy 


(e — e) 2 = n,Q 2 . 
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quantity being independent of the arbitrary constant involved in 
the definition of the potential energy. Since 

e p = |ra©, 



2 

5 

n 

2 de q 
n dl p 

2 dl 2 2 dl q 
n de p n n dl p 


( 209 ) 

( 210 ) 
( 211 ) 


These equations show that when the number of degrees of free¬ 
dom of the systems is very great, the mean squares of the anomalies 
of the energies (total, potential, and kinetic) are very small in com¬ 
parison with the mean square of the kinetic energy, unless indeed 
the differential coefficient de q /de p is of the same order of magnitude 
as n. Such values of dl q /de p can only occur within intervals (l p — l' p ) 
which are of the order of magnitude of n _1 , unless it be in cases in 
which lg is in general of an order of magnitude higher than e p . 
Postponing for the moment the consideration of such cases, it will 

We may also write in this case, 


- Cj ) 2 _ 2 
(dq - £a ) 2 


(£-e ) 2 


1 


n 


(e- £a ) 2 


n 
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be interesting to examine more closely the case of large values of 
de q /de p within narrow limits. Let us suppose that for e' p and e" the 
value of de q /de p is of the order of magnitude of unity, but between 
these values of e p very great values of the differential coefficient oc¬ 
cur. Then in the ensemble having modulus 0 ,/ and average energies 
e" and e", values of e q sensibly greater than e q will be so rare that 
we may call them practically negligible. They will be still more rare 
in an ensemble of less modulus. For if we differentiate the equation 


Vg 


*l>q ~ £q 

0 


regarding e q as constant, but 0 and therefore ijj q as variable, we get 


d7j q \ _ 1 d'lpq ij) q - e q 

del - 0 ~de © 2 

' e q 

whence by (192) 

d V q \ _ e q ~e q 

de) ~ 0 2 ' 

That is, a diminution of the modulus will diminish the probability of 
all configurations for which the potential energy exceeds its average 
value in the ensemble. Again, in the ensemble having modulus 0 7 
and average energies e' p and e' q , values of e q sensibly less than e' q 
will be so rare as to be practically negligible. They will be still 
more rare in an ensemble of greater modulus, since by the same 
equation an increase of the modulus will diminish the probability of 
configurations for which the potential energy is less than its average 
value in the ensemble. Therefore, for values of 0 between 0 ; and Q", 
and of e p between e p and e", the individual values of e q will be 
practically limited to the interval between e' q and e q . 


( 212 ) 

(213) 






STATISTICAL MECHANICS 


87 


In the cases which remain to be considered, viz., when de q /de p 
has very large values not confined to narrow limits, and consequently 
the differences of the mean potential energies in ensembles of differ¬ 
ent moduli are in general very large compared with the differences 
of the mean kinetic energies, it appears by (210) that the anomalies 
of mean square of potential energy, if not small in comparison with 
the mean kinetic energy, will yet in general be very small in compar¬ 
ison with differences of mean potential energy in ensembles having 
moderate differences of mean kinetic energy,—the exceptions being 
of the same character as described for the case when de q /de p is not 
in general large. 

It follows that to human experience and observation with respect 
to such an ensemble as we are considering, or with respect to systems 
which may be regarded as taken at random from such an ensemble, 
when the number of degrees of freedom is of such order of magnitude 
as the number of molecules in the bodies subject to our observation 
and experiment, e — e, e p — e p , e q — e q would be in general vanish¬ 
ing quantities, since such experience would not be wide enough to 
embrace the more considerable divergencies from the mean values, 
and such observation not nice enough to distinguish the ordinary 
divergencies. In other words, such ensembles would appear to hu¬ 
man observation as ensembles of systems of uniform energy, and in 
which the potential and kinetic energies (supposing that there were 
means of measuring these quantities separately) had each separately 
uniform values.* Exceptions might occur when for particular values 
of the modulus the differential coefficient de q /de p takes a very large 
value. To human observation the effect would be, that in ensembles 
in which 0 and e p had certain critical values, e q would be indetermi- 

*This implies that the kinetic and potential energies of individual systems 
would each separately have values sensibly constant in time. 
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nate within certain limits, viz., the values which would correspond 
to values of 0 and e p slightly less and slightly greater than the crit¬ 
ical values. Such indeterminateness corresponds precisely to what 
we observe in experiments on the bodies which nature presents to 
us.* 

To obtain general formulae for the average values of powers of 
the energies, we may proceed as follows. If h is any positive whole 
number, we have identically 

dq n , 

(214) 


(215) 


(216) 


(217) 


*As an example, we may take a system consisting of a fluid in a cylinder 
under a weighted piston, with a vacuum between the piston and the top of 
the cylinder, which is closed. The weighted piston is to be regarded as a part 
of the system. (This is formally necessary in order to satisfy the condition of 
the invariability of the external coordinates.) It is evident that at a certain 
temperature, viz., when the pressure of saturated vapor balances the weight of 
the piston, there is an indeterminateness in the values of the potential and total 
energies as functions of the temperature. 


all 


e h e © dpi... dq n = @ 2 — / • / e n 1 e © dpi... 


d 


all 


r-h 1 ^ o 


J phases 

i.e., by (108), 

Hence 

and 


dO I phases 


— b 


d 


e h e Q = 0 2 — ( e M e © 1 . 


-r ^ d V 

e e e = ( e beJ e e ' 


-r t d \ k _± 

= ^ < e ae e e - 
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For h — 1, this gives 


,2 d ( ^ 


* = "^59 §!• 


which agrees with (191). 
From (215) we have also 


( 218 ) 


>< ie h ~ l 


= 16“-' +e> 2 — = (e + tf—) e 1 - 1 , 


d 


d 


e '‘ =,?+0 d0 l £ ' 


h—1 


(219) 

( 220 ) 


In like manner from the identical equation 


/ all 
config. 


€q 1 

e h Q e~ e A 2 dq 1 ... dq n 


d 


all 


h -1 -7T 1 


= 02 ^ /Ag. / <-‘e-eA|(221) 


we get 


and 


-j- ©. / 2 d 

e h = e © 0 2 — 

9 V d© 


<i = I <r. + 9 2 A 


i>q 

e ©, 


d© 


ft .—i 


©• 


( 222 ) 


(223) 


With respect to the kinetic energy similar equations will hold 
for averages taken for any particular configuration, or for the whole 
ensemble. But since 

- _ n n 

€ p ~ 2®’ 
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the equation 


reduces to 


d 


fh — I £ i - | fh—l 

p \ p+ do' p 


(224) 


—r (n d \ —r—r n (n^ d 

e h = -0 + @ 2 — e h_1 = - -0 + 0 — 

p \2 dQ p 2 V 2 de 


h -1 


0. (225) 


We have therefore 


—77 /n 


eh = 
p 



(226) 

+ 2 )(i + i)^. 

(227) 

r(|n + h) , _ 
r(|n) 

(228) 


*In the case discussed in the note on page 61 we may easily get 


( 6 q ~ e ot) h — [ e q — e a + Q — ) (e q — e a ) 


h -1 


which, with 


gives 


e q e a — 2 


h -1 


<- = I i e + 02 ^)< e «“ o- - 5 Gfe + e> A , e. 


Hence 

Again 


(e 9 - ea)^ = e£. 


(e - e a ) h = [e-e a + 0 2 (e - e^" 1 , 
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The average values of the powers of the anomalies of the energies 
are perhaps most easily found as follows. We have identically, since 
e is a function of 0, while e is a function of the p’s and g’s, 


©"— 


d f all 


d© J phases > 
f all 
phases 


(e - e) e © dpi ... dq n 


e © dpi... dq n , 
(229) 


he., by (108), 


d 


e 2 — 


d© 

or since by (218) 

d 


e — e) h e © 


e(e — e) h — h(e — e)^ -1 © 2 -^- 


_±_ 

e ©, 


(230) 


o d _± _± 

0 3e e ~ e= * e ~ e ' 


© 2 ^(e ~ ^ + ( e _ e ) h£ = 6 ( e “ e ) h “ h ( 6 ~ 6 ) /l 1@2 ^’ 


d 


de 


(e _ e)^i = 0 2 — (e - e)* + /r(e - e)^© 2 —. (231) 


which with 


e — e Q = n© 


gives 


d© 


(e - e a ) h = n© + 0 2 — (e - e a)^ 1 = n n© + 0 2 — © 


d© 


h-1 




F{n + h) h 

r(n) 


hence 
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In precisely the same way we may obtain for the potential energy 

(I - (232) 

By successive applications of (231) we obtain 

V 1 W=De, 

= D\ 

(e — e) 4 = D 3 e + 3(De) 2 , 

(e - e) 5 = D A I + 10DID 2 !, 

(e - e) 6 = D 5 ! + 15DlD 3 I + 10(£> 2 e) 2 + 15( J De) 3 etc., 

where D represents the operator Q 2 d/dQ. Similar expressions re¬ 
lating to the potential energy may be derived from (232). 

For the kinetic energy we may write similar equations in which 
the averages may be taken either for a single configuration or for 
the whole ensemble. But since 

dl p n 
dG = 2’ 

the general formula reduces to 

(e p - I p ) h+1 = 0 2 ^(e p - t P ) h + \nh<d 2 (e p - I p ) h ~ l , (233) 

or 



20 d_ (e p - I p ) h 2h (e p - I p ) h 2h (e p - I p ) h ~ l 

n d<3 Ip n Ip n e^ -1 

(234) 
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But since identically 



the value of the corresponding expression for any index will be in¬ 
dependent of 0 and the formula reduces to 



(235) 


It will be observed that when if) or e is given as function of 0, 
all averages of the form e h or (e — e) h are thereby determined. So 
also if -0 q or e q is given as a function of 0, all averages of the form 
e q or ( e q — e q ) h are determined. But 

e q = e — yn0. 

*In the case discussed in the preceding foot-notes we get easily 

~e q ) h = (tp-£ P ) h , 

and _ _ 

h / — \ h 

-M- 
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Therefore if any one of the quantities ij), i() q , e, e q is known as a 
function of 0, and n is also known, all averages of any of the forms 
mentioned are thereby determined as functions of the same variable. 
In any case all averages of the form 


h^) h 

are known in terms of n alone, and have the same value whether 
taken for the whole ensemble or limited to any particular configu¬ 
ration. 

If we differentiate the equation 


/ all 
phases 


'ip—e 

e © dpi... dq n = 1 


with respect to a \, and multiply by 0, we have 


/ 


/ 


dip 

da i 


de 

da\ 


'ip—e 

e © dpi... dq n = 0. 


(236) 


(237) 


Differentiating again, with respect to a±, with respect to a 2 , and 


For the total energy we have in this case 
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with respect to 0, we have 


d 2 p> d 2 e 1 / d/ip de 
da\ daf © \dai da i 


d 2 p} 


d 2 ( 


1 

+ 77 


dtp 


ip—e 

e © dpi... dq n = 0, 

(238) 

de \ (dp) de 


da\da 2 da\da 2 0 \^ a i da\) \da 2 da 2 / 

e 0 dpi... dq n = 0, (239) 


d 2 p} (dp} de 

daidQ \dai da\ 


1 dp} p} — e 

QdQ~ 0 2 

e © dpi ... dq n = 0. (240) 


The multiple integrals in the last four equations represent the aver¬ 
age values of the expressions in the brackets, which we may therefore 
set equal to zero. The first gives 


dp} de — 
da\ da\ 1 


(241) 


as already obtained. With this relation and (191) we get from the 
other equations 


{ A 1 -A 1 y = e[pp- d (t)=e^ dA ' dA 1 


dal dal 


da\ dai 


(242) 


[A\ — A\){A2 — A 2 ) — 0 


d 2 e 


d 2 p> 


= 0 


da\da2 da\da2 t 
dA\ dAi \ (dA2 


dao dai 


= 0 


dAi 


da\ da± 


(243) 
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(Ai — Ai)(e — e) — — 0 


2 d ^ 2 dA\ 2 d V 


= 0 


dQ da i 


da\ d@ 

We may add for comparison equation (205), which might be derived 
from (236) by differentiating twice with respect to 0: 


, d 2 \j) 


(e — e) 2 = -0 2 ^A = 0 2 ^ 


d& 2 


de 
dQ ' 


The two last equations give 


dA 


(Ai - Ai)(e - e) — — (e - e) 2 . 


(244) 


(245) 


If if; or e is known as function of 0, ai, 02 , etc., (e — e) 2 
may be obtained by differentiation as function of the same vari¬ 
ables. And if -0, or A, or fj is known as function of 0, a±, 
etc., (Ai — A\){e — e) may be obtained by differentiation. But 
(Ai — Ai) 2 and (A\ — Ai)(A 2 — A 2 ) cannot be obtained in any 
similar manner. We have seen that (e — e) 2 is in general a van¬ 
ishing quantity for very great values of n, which we may regard 
as contained implicitly in 0 as a divisor. The same is true of 
(Ai — Ai)(e — e). ft does not appear that we can assert the same 
of (Ax — Ai) 2 or (Ai — A x )(A 2 — A 2 ), since d 2 e/da\ may be very 
great. The quantities d 2 e/da\ and d 2r ^/da\ belong to the class 
called elasticities. The former expression represents an elasticity 
measured under the condition that while a 1 is varied the internal 
coordinates qi, ... q n all remain fixed. The latter is an elasticity 
measured under the condition that when a\ is varied the ensemble 
remains canonically distributed within the same modulus. This cor¬ 
responds to an elasticity in physics measured under the condition 
of constant temperature. It is evident that the former is greater 
than the latter, and it may be enormously greater. 
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The divergences of the force A± from its average value are due 
in part to the differences of energy in the systems of the ensemble, 
and in part to the differences in the value of the forces which exist 
in systems of the same energy. If we write A\ | e for the average value 
of A\ in systems of the ensemble which have any same energy, it 
will be determined by the equation 


Ai 


€ 



'Ip—6 

e © dpi... dq n 


ip—e 

e © dpi... dq. n 


(246) 


where the limits of integration in both multiple integrals are two 
values of the energy which differ infinitely little, say e and e + de. 

1p — € 

This will make the factor e © constant within the limits of integra¬ 
tion, and it may be cancelled in the numerator and denominator, 
leaving 




dpi • • • dq n 


(247) 


dpi... dq r 


where the integrals as before are to be taken between e and e + de. 
A A is therefore independent of 0, being a function of the energy 
and the external coordinates. 

Now we have identically 


A\ — Ai — ( A 1 — Ai | e ) + ( A\ | e — Ai ), 

where A\ — A\ | £ denotes the excess of the force (tending to in¬ 
crease ai) exerted by any system above the average of such forces 
for systems of the same energy. Accordingly, 


(A, - AO 2 


(^i _ Ai |J 2 +2(A! - A 1 | e )(Ai | e - A 1 )+(A 1 1 6 - Ai) 2 . 
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But the average value of (Ai — Ai | e — Ai) for systems of the 
ensemble which have the same energy is zero, since for such systems 
the second factor is constant. Therefore the average for the whole 
ensemble is zero, and 


(A - A) 2 = (A - A 1 |J 2 + (A, | e - ^) 2 . (248) 

In the same way it may be shown that 

{Ai — Ai)(e — e) = (A]_ | e — Ai)(e — e). (249) 


It is evident that in ensembles in which the anomalies of energy e —e 
may be regarded as insensible the same will be true of the quantities 
represented by A\ | e — A\. 

The properties of quantities of the form A\ | e will be farther 
considered in Chapter X, which will be devoted to ensembles of 
constant energy. 

It may not be without interest to consider some general formulae 
relating to averages in a canonical ensemble, which embrace many 
of the results which have been given in this chapter. 

Let u be any function of the internal and external coordinates 
with the momenta and modulus. We have by definition 


u = 


I all 
phases 


'ip—e 

ue & dp] ... dq n . 


(250) 


If we differentiate with respect to 0, we have 


du 

dQ 



u 

e 2 


($-€) + 


u oh/A 

0 dBj 


Ip—6 

e~e~ dpi ... dq n , 


du du u{ij) — e) u da^ 
dO = dO e 2 + 0 d0‘ 


or 


( 251 ) 
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Setting u — 1 in this equation, we get 

'ijj — e 

dG = 0 ’ 

and substituting this value, we have 


du 

dG 


du 

dG 


ue ue 
02 “ 02 ’ 


or 


9 du du _ _ 7 - =T7 - 

0 ^0 _e ^0 = ue ~ ue = (u-u)(e-e). (252) 

If we differentiate equation (250) with respect to a (which may 
represent any of the external coordinates), and write A for the 

force — —, we get 


da' 

du 

da 


I all 

phases 


or 


du u d'lb u ]tzi , 

— + - — + -A e 0 dp i ... dq n , 
da 0 da G 


du du u di/i uA 
da da 0 da 0 


(253) 


Setting u — 1 in this equation, we get 

da 

Substituting this value, we have 

du du uA uA 
da da 0 0 ’ 


or 


du 


du —- — 


0 —— © — = uA — uA = (u — u){A — A), 
da da 


(254) 

(255) 
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Repeated applications of the principles expressed by equations 
(252) and (255) are perhaps best made in the particular cases. Yet 
we may write (252) in this form 

(e + D)(u — u), (256) 

where D represents the operator @ 2 d/d@. 

Hence 

(e + D) h (u - u) = 0, (257) 

where h is any positive whole number, ft will be observed, that since 
e is not function of 0, (e + D) h may be expanded by the binomial 
theorem. Or, we may write 

(e + D)u = (e + D)u, (258) 

whence 

(e + D) h u = (e + D) h u. (259) 

But the operator (e + D) h , although in some respects more simple 
than the operator without the average sign on the e, cannot be 
expanded by the binomial theorem, since e is a function of 0 with 
the external coordinates. 

So from equation (254) we have 


whence 


and 


(A d\ 
V© + da) 


(u 


u) = 0, 


(260) 




(u — u) = 0; 


(261) 



(262) 
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whence 



The binomial theorem cannot be applied to these operators. 

Again, if we now distinguish, as usual, the several external 
coordinates by suffixes, we may apply successively to the expres¬ 
sion u — u any or all of the operators 

e + 0"——, A\ T 0 ——, A 2 + 0 ——, etc. (264) 

d\y udi d&2 

as many times as we choose, and in any order, the average value 
of the result will be zero. Or, if we apply the same operators to u, 
and finally take the average value, it will be the same as the value 
obtained by writing the sign of average separately as u, and on e, 
Ai, A 2 , etc., in all the operators. 

If u is independent of the momenta, formulae similar to the 
preceding, but having e q in place of e, may be derived from equa¬ 
tion (179). 



VIII. 


ON CERTAIN IMPORTANT FUNCTIONS OF THE ENERGIES 


OF A SYSTEM. 


In order to consider more particularly the distribution of a canonical 
ensemble in energy, and for other purposes, it will be convenient to 
use the following definitions and notations. 

Let us denote by V the extension-in-phase below a certain limit 
of energy which we shall call e. That is, let 



(265) 


the integration being extended (with constant values of the external 


coordinates) over all phases for which the energy is less than the 


limit e. We shall suppose that the value of this integral is not 
infinite, except for an infinite value of the limiting energy. This will 
not exclude any kind of system to which the canonical distribution 
is applicable. For if 



/ 


e 



taken without limits has a finite value,* the less value represented 


by 



taken below a limiting value of e, and with the e before the integral 
sign representing that limiting value, will also be finite. Therefore 

*This is a necessary condition of the canonical distribution. See Chapter IV, 
p. 39. 
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the value of V, which differs only by a constant factor, will also be 
finite, for finite e. It is a function of e and the external coordinates, 
a continuous increasing function of e, which becomes infinite with e, 
and vanishes for the smallest possible value of e, or for e = — oo, if 
the energy may be diminished without limit. 

Let us also set 

0 = l°g^-- (266) 

The extension in phase between any two limits of energy, e' and e", 
will be represented by the integral 



(267) 


And in general, we may substitute de for dpi... dq n in a 2n-fold 
integral, reducing it to a simple integral, whenever the limits can 
be expressed by the energy alone, and the other factor under the 
integral sign is a function of the energy alone, or with quantities 
which are constant in the integration. 

In particular we observe that the probability that the energy 
of an unspecified system of a canonical ensemble lies between the 
limits e' and e" will be represented by the integral* * 



(268) 


and that the average value in the ensemble of any quantity which 
only varies with the energy is given by the equationt 


u = 


1 tl, 
ue © +<p de, 


>v =o 


(269) 


"“Compare equation (93). 

*Compare equation (108). 
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where we may regard the constant ^ as determined by the equation* 

± r =o ° e , 

e~e = / e~® + *de. (270) 

Jv =o 

In regard to the lower limit in these integrals, it will be observed 
that V = 0 is equivalent to the condition that the value of e is the 
least possible. 

In like manner, let us denote by V q the extension-in-configuration 
below a certain limit of potential energy which we may call e q . That 
is, let 

V q = J- ■ J A? dqi... dq n , (271) 

the integration being extended (with constant values of the external 
coordinates) over all configurations for which the potential energy is 
less than e q . V q will be a function of e q with the external coordinates, 
an increasing function of e q , which does not become infinite (in such 
cases as we shall consider 1) for any finite value of e q . It vanishes for 
the least possible value of e q , or for e q = — oo, if e q can be dimin¬ 
ished without limit. It is not always a continuous function of e q . 
In fact, if there is a finite extension-in-conhguration of constant po¬ 
tential energy, the corresponding value of V q will not include that 
extension-in-configuration, but if e q be increased infinitesimally, the 
corresponding value of V q will be increased by that finite extension- 
in-conhguration. 

Let us also set 

0 g = log ^- (272) 

*Compare equation (92). 

+ If V q were infinite for finite values of e q , V would evidently be infinite for 
finite values of e. 
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The extension-in-configuration between any two limits of potential 
energy e' q and e q may be represented by the integral 



(273) 


whenever there is no discontinuity in the value of V q as function 

of e q between or at those limits, that is, whenever there is no finite 

extension-in-configuration of constant potential energy between or 

at the limits. And in general, with the restriction mentioned, we 

1 

may substitute e^ q de q for A| dq±... dq n in an n-fold integral, re¬ 
ducing it to a simple integral, when the limits are expressed by 
the potential energy, and the other factor under the integral sign is 
a function of the potential energy, either alone or with quantities 
which are constant in the integration. 

We may often avoid the inconvenience occasioned by formulae 
becoming illusory on account of discontinuities in the values of V q as 
function of e q by substituting for the given discontinuous function 
a continuous function which is practically equivalent to the given 
function for the purposes of the evaluations desired. It only re¬ 
quires infinitesimal changes of potential energy to destroy the finite 
extensions-in-configuration of constant potential energy which are 
the cause of the difficulty. 

In the case of an ensemble of systems canonically distributed 
in configuration, when V q is, or may be regarded as, a continuous 
function of e q (within the limits considered), the probability that 
the potential energy of an unspecified system lies between the limits 
e' q and e" is given by the integral 
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where ^ may be determined by the condition that the value of the 
integral is unity, when the limits include all possible values of e q . In 
the same case, the average value in the ensemble of any function of 
the potential energy is given by the equation 

r e q=°° 4>-e q 

u= ue e +<t>q de q . (275) 

Jv q =0 

When V q is not a continuous function of e q , we may write dV q for 
e^ q de q in these formulae. 

In like manner also, for any given configuration, let us denote 
by V p the extension-in-velocity below a certain limit of kinetic en¬ 
ergy specified by e p . That is, let 

V p = j- ■ j Ap dpi... dp n , (276) 

the integration being extended, with constant values of the coord¬ 
inates, both internal and external, over all values of the momenta for 
which the kinetic energy is less than the limit e p . V p will evidently be 
a continuous increasing function of e p which vanishes and becomes 
infinite with e p . Let us set 


0p = 1g g^J. (277) 

The extension-in-velocity between any two limits of kinetic energy 
e' p and e" may be represented by the integral 



e^ p de p . 


(278) 
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And in general, we may substitute e^ p de p for A p dpi ... dp n or 
1 

A? dip ... dq n in an n-fold integral in which the coordinates are con¬ 
stant, reducing it to a simple integral, when the limits are expressed 
by the kinetic energy, and the other factor under the integral sign 
is a function of the kinetic energy, either alone or with quantities 
which are constant in the integration. 

It is easy to express V p and in terms of e p . Since A p is function 
of the coordinates alone, we have by definition 


Vp = Af 


dpi ... dp r 


(279) 


the limits of the integral being given by e p . That is, if 


e p = F(p u ...p n ), 


(280) 


the limits of the integral for e p — 1, are given by the equation 


F(p 1 ,...p„) = 1, 


(281) 


and the limits of the integral for e p = a 2 , are given by the equation 


F(pi ,. .. p n ) = a 2 . 


(282) 


But since F represents a quadratic function, this equation may be 
written 

F 1. (283) 


a a 


The value of V p may also be put in the form 


V p = a n Al 


rP 1 ,Pn 

d — ... d —. 
a a 


(284) 
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Now we may determine V p for e p — 1 from(279) where the limits 
are expressed by (281), and V p for e p = a 2 from (284) taking the 
limits from (283). The two integrals thus determined are evidently 
identical, and we have 


(V P h p=a 2 = a n (V p ) ep=l , (285) 

n 

i.e., V p varies as e p . We may therefore set 

- n 

V p = Ce p , e+* = - Ce p (286) 

where C is a constant, at least for fixed values of the internal 
coordinates. 

To determine this constant, let us consider the case of a canonical 
distribution, for which we have 


where 


e e 


+<t> P 


d,6 p 1, 


e e = (27T0) 2 . 

Substituting this value, and that of (V' p from (286), we get 


n 


5e tt-1 


-C I e ® e p de p = (27r0) 2 , 






Hg)=(2^)L 


n 


n 


-CT - =( 2 tt ) 2 , 


c = 


n 

(2t)2 

T(|n + 1)' 


( 287 ) 
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Having thus determined the value of the constant C, we may 
substitute it in the general expressions (286), and obtain the fol¬ 
lowing values, which are perfectly general: 

n 

_ (2ire p ) 2 

P r(|n + i)’ 

n — _i 

r (|n) 

It will be observed that the values of V p and </> p for any given e p 
are independent of the configuration, and even of the nature of the 
system considered, except with respect to its number of degrees of 
freedom. 

Returning to the canonical ensemble, we may express the prob¬ 
ability that the kinetic energy of a system of a given configuration, 

*Very similar values for V q . e^ q , V, and may be found in the same way 
in the case discussed in the preceding foot-notes (see pages 61, 84, 90, and 93), 
in which e q is a quadratic function of the q’’ s, and A^ independent of the q’s. 
In this case we have 

i n n 

/M 5 (2tt) 2 ( e q - e a ) 2 

UJ r(i» + i) ’ 

in 71 

f M 5 (2tt) 2 (e 9 - e a )2 

\A q J r(in) 

(2tt)"( e-e Q )" 

UJ r(n+l) ’ 

(27r)"(e^e a r~ 1 

\A q ) T(n) 



(288) 

(289) 
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but otherwise unspecified, falls within given limits, by either mem¬ 
ber of the following equation 


/ 1pp—£p 

' e 


+0P de p 


1 

r(p 



(290) 


Since this value is independent of the coordinates it also represents 
the probability that the kinetic energy of an unspecified system of 
a canonical ensemble falls within the limits. The form of the last 
integral also shows that the probability that the ratio of the kinetic 
energy to the modulus falls within given limits is independent also of 
the value of the modulus, being determined entirely by the number 
of degrees of freedom of the system and the limiting values of the 
ratio. 

The average value of any function of the kinetic energy, either 
for the whole ensemble, or for any particular configuration, is given 
by 

1 r°° r*_i 

u = —n - / ue © €p de p * (291) 

0 2 T(|n) do 


*The corresponding equation for the average value of any function of the 
potential energy, when this is a quadratic function of the q’ s, and is inde¬ 
pendent of the q' s, is 

1 f°° n 

u = n - / ue e (e 9 -e Q )2 de q . 

In the same case, the average value of any function of the (total) energy is given 
by the equation 


u = 


0 n r(n) J Q 


u e 


e—e a 

© 


(e-e Q ) n 1 de. 
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Thus: 


_ T (m + ^n) 

r«n) 


- 


0 m , if m + yn > 0;* 


v = __ 

p T(|n + 1 )rQn) 


( 2710 ) 2 ; 


(292) 

(293) 


-x- r (™- !) / n _w 


gY'p — 


[T(Jn)]2 

d(f>p 


(2tt) 2 02- 1 , if tt, > 1; 


de p 0 


= —, if n > 2; 


e-'t’p V p = 0. 


Hence in this case 


(e g - e a ) m = 


T(to + |n) 
T(|n) 


0 m , if to + in > 0. 


(e~e a ) m = T{ ™\ n) Q m , if to + n > 0. 
T(n) 


e ”14 = e _, 4 V = 0, 


d(f) q 


de q 0 


= —, if n > 2, 


and 


# 1 -f ^ i 

= —, it n > 1. 


de 0 ! 


(294) 

(295) 

(296) 


If n = 1, = 27r and d<f>/de = 0 for any value of e. If n = 2, the case is the 

same with respect to (j) q . 

*This equation has already been proved for positive integral powers of the 
kinetic energy. See page 90. 
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If n = 2, e^ p = 2 ti, and dxj) p /de p = 0, for any value of e p . 
The definitions of V, V q , and V p give 

V = [[ dV p dV q , 


(297) 


where the integrations cover all phases for which the energy is less 
than the limit e, for which the value of V is sought. This gives 


V = 


V p dV q , 


(298) 


and 


e Y = 


dV 

de 


e ^ d.V q , 


where V p and e^ p are connected with V q by the equation 


€ P + e q = constant = e. 


(299) 


(300) 


If n > 2, e^ p vanishes at the upper limit, i.e., for e p = 0, and we 
get by another differentiation 


dcj) 


e* pd ?^dV n . 


de Jv q =o de p 


(301) 


We may also write 


1 / = 


e r = 


V p e^ q de qi 


P + <t>q rip 

O/Cgf, 


(302) 

(303) 
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etc., when V q is a continuous function of e q commencing with the 
value V q = 0, or when we choose to attribute to V q a fictitious con¬ 
tinuity commencing with the value zero, as described on page 105. 

If we substitute in these equations the values of V p and e^ v which 
we have found, we get 


n 



n 

Zq) 2 dV qi 


2-' dV q , 


(304) 

(305) 


where e^ q de q may be substituted for dV q in the cases above de¬ 
scribed. If, therefore, n is known, and V q as function of e qi V and e^ 
may be found by quadratures. 

It appears from these equations that V is always a continuous 
increasing function of e, commencing with the value V = 0, even 
when this is not the case with respect to V q and e q . The same is 
true of e^, when n > 2, or when n = 2 if V q increases continuously 
with e q from the value V q = 0. 

The last equation may be derived from the preceding by dif¬ 
ferentiation with respect to e. Successive differentiations give, if 
h <\n + 1, 


d h V 
de h 



d h V p 

dtp 


dVn 


n 

(2tt)2 

T (|n + 1 — h) 



n u 

e q )i~ h dV q . 

(306) 


d h V/de h is therefore positive if h < \n + 1. It is an increasing 
function of e, if h < \n. If e is not capable of being diminished 
without limit, d h V/de h vanishes for the least possible value of e, if 
h < \n. 
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If n is even, 


d^V 
de 2 




(307) 


That is, V q is the same function of e q as 


d 2V 


of e. 


(2n) 2 de 2 

When n is large, approximate formulae will be more available. It 
will be sufficient to indicate the method proposed, without precise 
discussion of the limits of its applicability or of the degree of its 
approximation. For the value of corresponding to any given e, 
we have 

re q =e ft 

de p , (308) 


e^ = 


e 4>p+4>q de q = 


where the variables are connected by the equation (300). The max¬ 
imum value of (f) p + ef) q is therefore characterized by the equation 

d(j) p d(j) q 


de r , 


de n 


(309) 

L -p ^<1 

The values of e p and e q determined by this maximum we shall dis¬ 
tinguish by accents, and mark the corresponding values of functions 
of e p and e q in the same way. Now we have by Taylor’s theorem 


-*=■w (^y - o+ 


de p 

_ 1 / 1 / dcfrq 


d 4 


(cp - O 2 


+ etc. (310) 


€ 'q ) + 


de\ 


d 2 (p q Y (e Q -e') 2 


+ etc. (311) 


If the approximation is sufficient without going beyond the quadratic 
terms, since by (300) 

_ e p' — _ ( e 9 _ ^q)i 
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we may write 



J —OO 


where the limits have been made ±oo for analytical simplicity. This 
is allowable when the quantity in the square brackets has a very 
large negative value, since the part of the integral corresponding 
to other than very small values of e q — e' q may be regarded as a 
vanishing quantity. 

This gives 



From this equation, with (289), (300) and (309), we may determine 
the value of (j) corresponding to any given value of e, when (p q is 
known as function of e q . 

Any two systems may be regarded as together forming a third 
system. If we have V or 0 given as function of e for any two systems, 
we may express by quadratures V and 0 for the system formed by 
combining the two. If we distinguish by the suffixes ( )i, ( ) 2 , ( )i 2 
the quantities relating to the three systems, we have easily from the 
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definitions of these quantities 



IV 2 de 2 , (315) 




e *i+fc de ^ ( 316 ) 


where the double integral is to be taken within the limits 

V\ = 0, V ‘2 — 0, and ei + e 2 = e± 2 , 

and the variables in the single integrals are connected by the last of 
these equations, while the limits are given by the first two, which 
characterize the least possible values of e\ and e 2 respectively. 

ft will be observed that these equations are identical in form 
with those by which V and (j) are derived from V p or cj) p and V q or 
<j) q , except that they do not admit in the general case those transfor¬ 
mations which result from substituting for V p or (j) p the particular 
functions which these symbols always represent. 

Similar formulae may be used to derive V q or (f) q for the com¬ 
pound system, when one of these quantities is known as function of 
the potential energy in each of the systems combined. 

The operation represented by such an equation as 


e 012 = 


e^e 4 * de 1 


is identical with one of the fundamental operations of the theory 
of errors, viz., that of finding the probability of an error from the 
probabilities of partial errors of which it is made up. It admits a 
simple geometrical illustration. 

We may take a horizontal line as an axis of abscissas, and 
lay off €i as an abscissa measured to the right of any origin, and 
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erect e^ 1 as a corresponding ordinate, thus determining a certain 
curve. Again, taking a different origin, we may lay off £2 as abscissas 
measured to the left, and determine a second curve by erecting the 
ordinates e^ 2 . We may suppose the distance between the origins 
to be 612 , the second origin being to the right if ei 2 is positive. 
We may determine a third curve by erecting at every point in the 
line (between the least values of e\ and 62 ) an ordinate which rep¬ 
resents the product of the two ordinates belonging to the curves 
already described. The area between this third curve and the axis 
of abscissas will represent the value of e^ 12 . To get the value of 
this quantity for varying values of e 12 , we may suppose the first 
two curves to be rigidly constructed, and to be capable of being 
moved independently. We may increase or diminish 612 by moving 
one of these curves to the right or left. The third curve must be 
constructed anew for each different value of 612 - 



IX. 


THE FUNCTION 0 AND THE CANONICAL DISTRIBUTION. 

In this chapter we shall return to the consideration of the canonical 
distribution, in order to investigate those properties which are espe¬ 
cially related to the function of the energy which we have denoted 
by 0. 

If we denote by N, as usual, the total number of systems in the 
ensemble, 

b-e_i , 

N e _ © _+ ^ de 

will represent the number having energies between the limits e and 
e + de. The expression 

AT e V+^ (317) 

represents what may be called the density-in-energy. This vanishes 
for e = oo, for otherwise the necessary equation 

re=oo xj)—e 

/ e~Q~ +<t> de = 1 (318) 

Jv =o 

could not be fulhlled. For the same reason the density-in-energy 
will vanish for e = —oo, if that is a possible value of the energy. 
Generally, however, the least possible value of the energy will be a 
finite value, for which, if n > 2 , will vanish,* and therefore the 
density-in-energy. Now the density-in-energy is necessarily positive, 
and since it vanishes for extreme values of the energy if n > 2 , it 
must have a maximum in such cases, in which the energy may be 
said to have its most common or most probable value, and which is 


See page 113. 
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determined by the equation 


dcf) 1 

de 0 


(319) 


This value of d(j)/de is also, when n > 2, its average value in the 
ensemble. For we have identically, by integration by parts, 


I v=o de 


ip—e 

e~e~ 


+</> 


v=o 


1 f e 00 i/>-e , 

+ ^ / ee+^de. (320) 
0 Jv=o 


If n > 2, the expression in the brackets, which multiplied by N 
would represent the density-in-energy, vanishes at the limits, and 
we have by (269) and (318) 


d(j) 1 

~HT = ©' 

It appears, therefore, that for systems of more than two degrees 
of freedom, the average value of d(j>/de in an ensemble canonically 
distributed is identical with the value of the same differential coef¬ 
ficient as calculated for the most common energy in the ensemble, 
both values being reciprocals of the modulus. 

Hitherto, in our consideration of the quantities V, V q , V p , </>, 
4> q , (j) p , we have regarded the external coordinates as constant. It 
is evident, however, from their definitions that V and (j) are in gen¬ 
eral functions of the external coordinates and the energy (e), that 
V q and (j) q are in general functions of the external coordinates and 
the potential energy (e q ). V p and <p p we have found to be functions 
of the kinetic energy (e p ) alone. In the equation 



e © = 


e 


©+</> 


de, 


v=o 


(322) 
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by which -0 may be determined, 0 and the external coordinates 
(contained implicitly in 0) are constant in the integration. The 
equation shows that 0 is a function of these constants. If their 
values are varied, we shall have by differentiation, if n > 2, 


t ( 1 0 \ 1 f e=0 ° e 

^{-e i ^Wi d °) = e* de L ee ^ de 

+ dchi f —^-e~e> + ^de + da 2 [ -j -— e _ 0 +< ^de + etc., (323) 

Jv =o “ a i 0v=o da 2 

(Since e^ 1 vanishes with V, when n > 2, there are no terms due to 
the variations of the limits.) Hence by (269) 




€ d(b 1 dd> . ^ 

— <i0 + —-— H - — j — dd 2 + etc., (324) 

0 dai d&2 


or, since 


^0— = d, (325) 

dip — fj dS — © —r~~ da 1 — © —— da 2 — etc. (326) 


d.a\ 

Comparing this with (112), we get 
d(j) Ai d(j) 


da\ 


Ai 

0 ’ 


dcio 


dci 2 


A 2 


etc. 


(327) 
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The first of these equations might be written* 


de 


f d0 \ _ _fd(f) 

\dai) \dej n \dai 


a,q 


but must not be confounded with the equation 


dcf) 

dci\ 


dcj) 

de 


de 

dd\ 


(f>,a 


which is derived immediately from the identity 

'dej)\ f de 


def) 

da± 


de 


da i 


0 , 0 , 


(328) 


(329) 


(330) 


Moreover, if we eliminate d'lp from (326) by the equation 

d'lp = 0 drj + fj dQ + de, (331) 

obtained by differentiating (325), we get 

de = —0 drj — O ^ da\ — 0 ^ da 2 — etc., (332) 

d&\ dci2 

or by (321), 

—drj = —y— de H—- da± H—- da 2 + etc. (333) 
de da i da 2 

*See equations (321) and (104). Suffixes are here added to the differential 
coefficients, to make the meaning perfectly distinct, although the same quanti¬ 
ties may be written elsewhere without the suffixes, when it is believed that there 
is no danger of misapprehension. The suffixes indicate the quantities which are 
constant in the differentiation, the single letter a standing for all the letters a ±, 
0 , 2 , etc., or all except the one which is explicitly varied. 
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Except for the signs of average, the second member of this equation 
is the same as that of the identity 

do <l< - -y— dai + —— da 2 + etc. (334) 

dc dd\ d&2 

For the more precise comparison of these equations, we may sup¬ 
pose that the energy in the last equation is some definite and fairly 
representative energy in the ensemble. For this purpose we might 
choose the average energy. It will perhaps be more convenient to 
choose the most common energy, which we shall denote by eo- The 
same suffix will be applied to functions of the energy determined 
for this value. Our identity then becomes 


,, t d< f > 
^„=l- 


de o + 


It has been shown that 



d(i\ -\- 
o 


/ dtp \ 

—— da 2 + etc. 

V da-2 J 0 


(335) 


d(j) (d(p\ 1 

de \de J 0 O’ 

when n > 2. Moreover, since the external coordinates have con¬ 
stant values throughout the ensemble, the values of d<j)/dai, d(j)/da 2 , 
etc. vary in the ensemble only on account of the variations of the 
energy (e), which, as we have seen, may be regarded as sensibly 
constant throughout the ensemble, when n is very great. In this 
case, therefore, we may regard the average values 



d(j) 

da\ 


d(p 

da2 


etc., 


as practically equivalent to the values relating to the most common 
energy 


/ d(f>\ f d(f)\ 

ydcii J q \da2 J q 


etc. 
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In this case also de is practically equivalent to de 0 . We have there¬ 
fore, for very large values of n, 

—dr) = dtp o (337) 


approximately. That is, except for an additive constant, —fj may 
be regarded as practically equivalent to 0o> when the number of 
degrees of freedom of the system is very great. It is not meant by 
this that the variable part of fj + 0 O is numerically of a lower order 
of magnitude than unity. For when n is very great, —fj and 0 O are 
very great, and we can only conclude that the variable part of 7/ + 0 o 
is insignificant compared with the variable part of fj or of 0o, taken 
separately. 

Now we have already noticed a certain correspondence between 
the quantities 0 and fj and those which in thermodynamics are 
called temperature and entropy. The property just demonstrated, 
with those expressed by equation (336), therefore suggests that the 
quantities 0 and de/dcp may also correspond to the thermodynamic 
notions of entropy and temperature. We leave the discussion of 
this point to a subsequent chapter, and only mention it here to 
justify the somewhat detailed investigation of the relations of these 
quantities. 

We may get a clearer view of the limiting form of the relations 
when the number of degrees of freedom is indefinitely increased, if 
we expand the function 0 in a series arranged according to ascending 
powers of e — 6 q. This expansion may be written 


0 = 00 + 



(e-eo) + 


/cP0\ (e ~ ep) 2 

\de 2 ) 0 2 



d>) 5 


+etc. 


( 338 ) 


o 


3! 
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Adding the identical equation 


0 - e _ 0 - e o e-e 0 

0 “ 0 0 


we get by (336) 


0 — e 


0 


0 - e 0 

0 


h0 0+ 


/ cp0\ (e — e 0 ) 2 

\*V 0 2 


Substituting this value in 


v de 3 y o 


(g ~ ^o ) 3 

3! 


fete. 


(339) 



i/j—e 

e~e~ 


+<p de, 


which expresses the probability that the energy of an unspecified 
system of the ensemble lies between the limits e' and e", we get 


f-eo 
e © 


+</> 0 


re" ( (e~eo ) 2 

eU 2 Jo 2 



h-^o) 3 

3 ! 


+etc. 


de. 


(340) 


When the number of degrees of freedom is very great, and e — eo 
in consequence very small, we may neglect the higher powers and 
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write* 


0-ep 

e © 


+4> o 


e 


d 2 <p \ (d-e 0 ) 2 

*V. 2 de. 


(341) 


This shows that for a very great number of degrees of free¬ 
dom the probability of deviations of energy from the most prob¬ 
able value (eo) approaches the form expressed by the ‘law of errors.’ 
With this approximate law, we get 


whence 


e © 


+00 


-2t r 

\ de2 )o 


e — e 0; ( e ~~ e o) 2 — 



f ^ 2 <A 

\ de2 )o 


0 - ep 

0 


+ 00 


^log 


( 

\dt 2 ) o 

-2vr 


-llog(27r(e-e) 2 ). 


(342) 


(343) 


(344) 


*If a higher degree of accuracy is desired than is afforded by this formula, 
it may be multiplied by the series obtained from 


e 



b-eo) 3 

3 ! 


+etc. 


by the ordinary formula for the expansion in series of an exponential function. 
There would be no especial analytical difficulty in taking account of a moderate 
number of terms of such a series, which would commence 


f (e~ eo) 3 
\de 3 ) o 3! 


(e-eo) 4 


o 


4! 


+ etc. 
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Now it has been proved in Chapter VII that 


We have therefore 



2 de _2 
n de p p 


r l - <t>0 - + <t>0 = -5 log(2ir(e - e) 2 ) = “I lo S (jj Jr 

"(345) 

approximately. The order of magnitude of rj — 0o is therefore that 
of logn. This magnitude is mainly constant. The order of magni¬ 
tude of rj T 0o — g log n is that of unity. The order of magnitude 
of 0o, and therefore of —rj, is that of n* 

Equation (338) gives for the first approximation 


= 


(0-0o)(e-eo) = 


( (e ~ eo ) 2 

\de 2 J o 2 

-eo) 2 


0 


i 

~ 2 ’ 


de 

de v p ’ 


(0 - 0 q )2 = 


(e — e 0 ) 2 n de 


02 


2 de n 


(346) 

(347) 

(348) 


The members of the last equation have the order of magnitude of n. 
Equation (338) gives also for the first approximation 


whence 


d(j> 1 

de 0 


( 

V de 2 ) o 


(e-eo), 


( 


dcp 

de 



(e - e o) 


( d2 ^ \ 

V de 2 ) o 


(e - e 0 ) 2 


- 1 , 


(349) 


* Compare (289), (314). 
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_ iv = f **y 7 ——^ = 1 = ^ ^y 

U ©; Uv 0 le eoJ (r^ Uv 0 

This is of the order of magnitude of n* 

ft should be observed that the approximate distribution of the 
ensemble in energy according to the ‘law of errors’ is not dependent 
on the particular form of the function of the energy which we have 
assumed for the index of probability {rj). In any case, we must have 

re=oo 

/ e r,+<?J de = 1, (351) 

Jv =o 

where e v+ ^ is necessarily positive. This requires that it shall vanish 
for e = oo, and also for e = — oo, if this is a possible value. It has 
been shown in the last chapter that if e has a (finite) least possible 
value (which is the usual case) and n > 2, will vanish for that 
least value of e. In general therefore rj + (j> will have a maximum, 
which determines the most probable value of the energy. If we 
denote this value by eo, and distinguish the corresponding values of 
the functions of the energy by the same suffix, we shall have 



drj 


de 


dcj) 


-7 1 + wH =0. 


de 


*We shall find hereafter that the equation 


d(f> 

de 


(e —e) = —1 

is exact for any value of n greater than 2, and that the equation 


_ ^\ 2 _ d 2 b 

\de 0 ) de 2 


(352) 


is exact for any value of n greater than 4. 
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The probability that an unspecified system of the ensemble falls 
within any given limits of energy (e' and e") is represented by 

r" 

/ e v+<l> de. 


If we expand ri and </> in ascending powers of e — e 0 , without going 
beyond the squares, the probability that the energy falls within the 
given limits takes the form of the ‘law of errors’— 


^o+vo 


d 2r q 
de 2 


d 2 (j) 

de 2 


h-eo ) 2 


de. 


(353) 


This gives 

ho + </>o = \ log 

and 


1 / d 2 r] 
2 tt l de 2 


(e - eo ) 2 = 


d 2 r) 

Id 2 


1 / d 2 (j)\ 
2 tt V de 2 ) a 


d 2 (p\ 
de 2 ), 


-i 


(354) 


(355) 


We shall have a close approximation in general when the quantities 
equated in (355) are very small, i.e., when 


_ ( _ ( d2 ^ \ 

\de 2 ) 0 V* 2 / 0 

is very great. Now when n is very great, —dde^/de 2 is of the same 
order of magnitude, and the condition that (356) shall be very great 
does not restrict very much the nature of the function r]. 

We may obtain other properties pertaining to average values in 
a canonical ensemble by the method used for the average of dcf)/de. 
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Let u be any function of the energy, either alone or with 0 and 
the external coordinates. The average value of u in the ensemble is 
determined by the equation 


u = 


1 

ue e ^ de. 


'v=o 


(357) 


Now we have identically 



u 

0 



ip—e 

e~e~ 


+<f> 


de 


ip—e 

ue © 


+0 


v=o 


(358) 


Therefore, by the preceding equation 


du 

de 


u deb 

e + “* = 


ue © +0 


v=o 


(359) 


If we set u = 1, (a value which need not be excluded,) the second 
member of this equation vanishes, as shown on page 118, if n > 2, 
and we get 


(360) 


dcj) 1 
~d~e = Q' 

as before. It is evident from the same considerations that the second 
member of (359) will always vanish if n > 2, unless u becomes 
infinite at one of the limits, in which case a more careful examination 
of the value of the expression will be necessary. To facilitate the 


*A more general equation, which is not limited to ensembles canonically 
distributed, is 


du 

de 


dri deb 




where rj denotes, as usual, the index of probability of phase. 
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discussion of such cases, it will be convenient to introduce a certain 
limitation in regard to the nature of the system considered. We have 
necessarily supposed, in all our treatment of systems canonically 
distributed, that the system considered was such as to be capable 
of the canonical distribution with the given value of the modulus. 
We shall now suppose that the system is such as to be capable of a 
canonical distribution with any (finite)* modulus. Let us see what 
cases we exclude by this last limitation. 

The impossibility of a canonical distribution occurs when the 
equation 



(361) 


fails to determine a finite value for 0. Evidently the equation can¬ 
not make 0 an infinite positive quantity, the impossibility therefore 
occurs when the equation makes 0 = — oo. Now we get easily 
from (191) 

4--i» 

If the canonical distribution is possible for any values of 0, we can 
apply this equation so long as the canonical distribution is possible. 
The equation shows that as 0 is increased (without becoming in¬ 
finite) —0 cannot become infinite unless e simultaneously becomes 
infinite, and that as 0 is decreased (without becoming zero) —0 can¬ 
not become infinite unless simultaneously e becomes an infinite neg¬ 
ative quantity. The corresponding cases in thermodynamics would 
be bodies which could absorb or give out an infinite amount of heat 
without passing certain limits of temperature, when no external 
work is done in the positive or negative sense. Such infinite values 

*The term finite applied to the modulus is intended to exclude the value 
zero as well as infinity. 
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present no analytical difficulties, and do not contradict the general 
laws of mechanics or of thermodynamics, but they are quite foreign 
to our ordinary experience of nature. In excluding such cases (which 
are certainly not entirely devoid of interest) we do not exclude any 
which are analogous to any actual cases in thermodynamics. 

We assume then that for any finite value of 0 the second member 
of (361) has a finite value. 

When this condition is fulfilled, the second member of (359) will 
vanish for u = e~^V. For, if we set 0' = 20, 

_± r * -j- r -j-m* 

e e V — e e / e 0 de < e ©' / e ©'de < e &' e ©', 

Jv =o Jv =o 

where denotes the value of "0 for the modulus 0'. Since the 
last member of this formula vanishes for e = oo, the less value 
represented by the first member must also vanish for the same value 
of e. Therefore the second member of (359), which differs only by a 
constant factor, vanishes at the upper limit. The case of the lower 
limit remains to be considered. Now 

e~bv < [ e~^ +4> de. 

Jv =o 

The second member of this formula evidently vanishes for the value 
of e, which gives V = 0, whether this be finite or negative infinity. 
Therefore, the second member of (359) vanishes at the lower limit 
also, and we have 


1 - e-^V + e-^V = 0 , 

de 0 de 


or 


e~*V = 0. 


(362) 
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This equation, which is subject to no restriction in regard to the 
value of n, suggests a connection or analogy between the function 
of the energy of a system which is represented by e~^V and the 
notion of temperature in thermodynamics. We shall return to this 
subject in Chapter XIV. 

If n > 2, the second member of (359) may easily be shown to 
vanish for any of the following values of u viz.: p, e^, e, e m , where 
m denotes any positive number. It will also vanish, when n > 4, 
for u = d(j)/de , and when n > 2 h for u = e~^ d h V/de h . When the 
second member of (359) vanishes, and n > 2, we may write 


(u — u ) 



1 \ d(f) u 
Qj =U Te~Q 


du 

de 


We thus obtain the following equations: 
If n > 2, 


( 0 - 0 ) 




(10 0 1 
^ Te ~ 0 = ^ 0 ’ 


(363) 


(364) 


or 


(e* 




, deb e ^ 

p<p — - 

de 0 



(e- 


c) 



dcj) 

de 


e <t> 

0’ 


(deft 1 \ dtp e 

\ de 0 / de 0 


- 1 , 


* 





6 m 

"0 


= —me” 1 " 1 . 


(365) 


(366) 

(367) 

(368) 
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If n > 4, 


f dcj) 1\ 2 f d(j)\ 2 1 d?(f) * * 

= V*y = 


(369) 


If n > 2 h 


. d h V d(j) 1 , d h V . d h V d<j> 

0 -— — 6 — 0 - = 0 - 

de h de 0 de h de h fie 

or 


, d h+l V 1 . d h V 

p~<p -— — p—<t> - 

de h+1 0 de h ’ 
whence _ 


ci /l+1 I/ _ 1 
de h+1 0 h 

Giving h the values 1, 2, 3, etc., we have 


e -<t> 


d h+1 V 

de h+1 


(370) 

(371) 


dcj) 1 
~d~e = Q 1 
d 2 cj) f dcj >\ 2 
de 2 + \de ) 


if n > 2, 


1 

02 ’ 


if n > 4, 


as already obtained. Also 


d 3 cj) d 2 cf) dcj) / dcj)\ 3 

de 3 de 2 de \de) 


1 

03 ’ 


if n > 6. 


(372) 


*This equation may also be obtained from equations (252) and (321). Com¬ 
pare also equation (349) which was derived by an approximative method. 

* Compare equation (360), obtained by an approximative method. 
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If V q is a continuous increasing function of e q , commencing with 
V q = 0, the average value in a canonical ensemble of any function 
of e q , either alone or with the modulus and the external coordinates, 
is given by equation (275), which is identical with (357) except that 
e, 0, and 0 have the suffix ( ) q . The equation may be transformed 
so as to give an equation identical with (359) except for the suffixes. 
If we add the same suffixes to equation (361), the finite value of its 
members will determine the possibility of the canonical distribution. 

From these data, it is easy to derive equations similar to (360), 
(362)-(372), except that the conditions of their validity must be 
differently stated. The equation 

e-hVg = 0 

requires only the condition already mentioned with respect to V q . 
This equation corresponds to (362), which is subject to no restric¬ 
tion with respect to the value of n. We may observe, however, that 
V will always satisfy a condition similar to that mentioned with 
respect to V q . 

If V q satisfies the condition mentioned, and e^ q a similar condi¬ 
tion, i.e., if e^ q is a continuous increasing function of e q , commencing 
with the value e ( p q = 0, equations will hold similar to those given for 
the case when n > 2, viz., similar to (360), (364)-(368). Especially 
important is 

d(f) q 1 

de q 0 

If V q , e^ q , (or dV q /de q ), d 2 V q /de 2 all satisfy similar conditions, we 
shall have an equation similar to (369), which was subject to the 
condition n > 4. And if d 3 V q /de 3 also satisfies a similar condition, 
we shall have an equation similar to (372), for which the condition 
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was n > 6. Finally, if V q and h successive differential coefficients 
satisfy conditions of the kind mentioned, we shall have equations 
like (370) and (371) for which the condition was n > 2 h. 

These conditions take the place of those given above relating 
to n. In fact, we might give conditions relating to the differential 
coefficients of V, similar to those given relating to the differential 
coefficients of V q , instead of the conditions relating to n, for the va¬ 
lidity of equations (360), (363)-(372). This would somewhat extend 
the application of the equations. 


X. 


ON A DISTRIBUTION IN PHASE CALLED 
MICROCANONICAL IN WHICH ALL THE SYSTEMS HAVE 
THE SAME ENERGY. 

An important case of statistical equilibrium is that in which all 
systems of the ensemble have the same energy. We may arrive at the 
notion of a distribution which will satisfy the necessary conditions 
by the following process. We may suppose that an ensemble is 
distributed with a uniform density-in-phase between two limiting 
values of the energy, e' and e", and with density zero outside of 
those limits. Such an ensemble is evidently in statistical equilibrium 
according to the criterion in Chapter IV, since the density-in-phase 
may be regarded as a function of the energy. By diminishing the 
difference of e' and e", we may diminish the differences of energy 
in the ensemble. The limit of this process gives us a permanent 
distribution in which the energy is constant. 

We should arrive at the same result, if we should make the den¬ 
sity any function of the energy between the limits e' and e", and 
zero outside of those limits. Thus, the limiting distribution obtained 
from the part of a canonical ensemble between two limits of energy, 
when the difference of the limiting energies is indefinitely dimin¬ 
ished, is independent of the modulus, being determined entirely by 
the energy, and is identical with the limiting distribution obtained 
from a uniform density between limits of energy approaching the 
same value. 

We shall call the limiting distribution at which we arrive by this 
process microcanonical. 

We shall find however, in certain cases, that for certain values 
of the energy, viz., for those for which e^ is infinite, this process 
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fails to define a limiting distribution in any such distinct sense as 
for other values of the energy. The difficulty is not in the process, 
but in the nature of the case, being entirely analogous to that which 
we meet when we try to find a canonical distribution in cases when 
^ becomes infinite. We have not regarded such cases as affording 
true examples of the canonical distribution, and we shall not regard 
the cases in which is infinite as affording true examples of the 
microcanonical distribution. We shall in fact find as we go on that 
in such cases our most important formulae become illusory. 

The use of formulae relating to a canonical ensemble which 
contain de instead of dpi... dq n , as in the preceding chapters, 
amounts to the consideration of the ensemble as divided into an 
infinity of microcanonical elements. 

From a certain point of view, the microcanonical distribution 
may seem more simple than the canonical, and it has perhaps been 
more studied, and been regarded as more closely related to the 
fundamental notions of thermodynamics. To this last point we shall 
return in a subsequent chapter. It is sufficient here to remark that 
analytically the canonical distribution is much more manageable 
than the microcanonical. 

We may sometimes avoid difficulties which the microcanonical 
distribution presents by regarding it as the result of the following 
process, which involves conceptions less simple but more amenable 
to analytical treatment. We may suppose an ensemble distributed 
with a density proportional to 

e u 2 


where u and e' are constants, and then diminish indefinitely the 
value of the constant uj. Here the density is nowhere zero until we 
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come to the limit, but at the limit it is zero for all energies ex¬ 
cept e'. We thus avoid the analytical complication of discontinuities 
in the value of the density, which require the use of integrals with 
inconvenient limits. 

In a microcanonical ensemble of systems the energy (e) is con¬ 
stant, but the kinetic energy ( e p ) and the potential energy ( e q ) vary 
in the different systems, subject of course to the condition 

e p + e q = e = constant. (373) 

Our first inquiries will relate to the division of energy into these two 
parts, and to the average values of functions of e p and e q . 

We shall use the notation u\ e to denote an average value in a 
microcanonical ensemble of energy e. An average value in a canon¬ 
ical ensemble of modulus O, which has hitherto been denoted by u, 
we shall in this chapter denote by u| 0 , to distinguish more clearly 
the two kinds of averages. 

The extension-in-phase within any limits which can be given in 
terms of e p and e q may be expressed in the notations of the preceding 
chapter by the double integral 


dV p dV q 


taken within those limits. If an ensemble of systems is distributed 
within those limits with a uniform density-in-phase, the average 
value in the ensemble of any function (w) of the kinetic and potential 
energies will be expressed by the quotient of integrals 
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Since dV p = e^ p de p , and de p = de when e q is constant, the 
expression may be written 

u e^ p de dV q 
e^ p de dV q 

To get the average value of u in an ensemble distributed rnicro- 
canonically with the energy e, we must make the integrations cover 
the extension-in-phase between the energies e and e + de. This gives 

re q =e 

de u e^ p dV q 

—, JVn = 0 

u L = — 

de 

But by (299) the value of the integral in the denominator is e^. We 
have therefore 

_ r e i =e 

u\ e = e~+ ue* p dVq, (374) 

Jv q =o 

where e^ p and V q are connected by equation (373), and u, if given 
as function of e p , or of e p and e qi becomes in virtue of the same 
equation a function of e q alone. 

We shall assume that has a finite value. If n > 1, it is 

evident from equation (305) that is an increasing function of e, 

and therefore cannot be infinite for one value of e without being 
infinite for all greater values of e, which would make — ij} infinite.* 
When n > 1, therefore, if we assume that is finite, we only 
exclude such cases as we found necessary to exclude in the study 





See equation (322). 
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of the canonical distribution. But when n — 1, cases may occur in 
which the canonical distribution is perfectly applicable, but in which 
the formulae for the microcanonical distribution become illusory, 
for particular values of e, on account of the infinite value of ev 
Such failing cases of the microcanonical distribution for particular 
values of the energy will not prevent us from regarding the canonical 
ensemble as consisting of an infinity of microcanonical ensembles.* 
From the last equation, with (298), we get 


e~^V p | = e 


' v=o 


VpdVg = e-tv. 


But by equations (288) and (289) 


Therefore 


e ^ P V P = —e„. 
n 


e = e~^pVr, I = — e 


n 


p i e • 


Again, with the aid of equation (301), we get 




= e-* r e ^dv,= f, 

Jv =o de p de 


(375) 

(376) 

(377) 

(378) 


* An example of the failing case of the microcanonical distribution is afforded 
by a material point, under the influence of gravity, and constrained to remain 
in a vertical circle. The failing case occurs when the energy is just sufficient to 
carry the material point to the highest point of the circle. 

It will be observed that the difficulty is inherent in the nature of the case, 
and is quite independent of the mathematical formulae. The nature of the 
difficulty is at once apparent if we try to distribute a finite number of material 
points with this particular value of the energy as nearly as possible in statistical 
equilibrium, or if we ask: What is the probability that a point taken at random 
from an ensemble in statistical equilibrium with this value of the energy will be 
found in any specified part of the circle? 
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if n > 2. Therefore, by (289) 


d(j) d(j) p 

de den 


n 


= _ - 1 e 


c-X 


if n > 2. 


(379) 


These results are interesting on account of the relations of 

the functions e~^V and — to the notion of temperature in 

de 

thermodynamics,—a subject to which we shall return hereafter. 
They are particular cases of a general relation easily deduced from 
equations (306), (374), (288) and (289). We have 


d h V 

de h 


d h V v 

de 1 ! 


dV q , if h < \n + 1. 


JV =0 

The equation may be written 


d h V 

de h 


= e 


We have therefore 


d h V . d h V„ 

- = f~ < Pv -— 

de h p 


de h 


d h V n 


iv =o 


p -VV _ —p-tfip fix/ 


r(in) 


T Un — h + 1) 


c h— 1 


(380) 


if h < + 1. For example, when n is even, we may make h = | n, 

which gives, with (307), 


1— 7T 


(2tt) 2e *(V) =T/n)( r 2 


(381) 


Since any canonical ensemble of systems may be regarded as 
composed of microcanonical ensembles, if any quantities u and v 
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have the same average values in every microcanonical ensemble, 
they will have the same values in every canonical ensemble. To 
bring equation (380) formally under this rule, we may observe that 
the first member being a function of e is a constant value in a 
microcanonical ensemble, and therefore identical with its average 
value. We get thus the general equation 


- e -* d " v v 

rU™) c „_l 1 

de h 

© at P 

0 T{\n — h+ 1) p 


d h V 

de h 


if h < \n + 1.* The equations 


e = 01 


(382) 


Q = e *V\ Q = e ^V p \ e = ^e p \ e , (383) 

1 dcj) 

Q = Te 


© 


d(j) p 

ddr) 


= (--1 


0 


r -\ 


I© 


(384) 


may be regarded as particular cases of the general equation. The 
last equation is subject to the condition that n > 2. 

The last two equations give for a canonical ensemble, if n > 2, 



(385) 


The corresponding equations for a microcanonical ensemble give, if 
n > 2, 


(a - -\ — I = ^ 

\ n) €p e £p ' 6 d log V ’ 


(386) 


which shows that dtp d log V approaches the value unity when n is 
very great. 


See equation (292). 
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If a system consists of two parts, having separate energies, we 
may obtain equations similar in form to the preceding, which relate 
to the system as thus divided.* We shall distinguish quantities 
relating to the parts by letters with suffixes, the same letters without 
suffixes relating to the whole system. The extension-in-phase of 
the whole system within any given limits of the energies may be 
represented by the double integral 



taken within those limits, as appears at once from the definitions 
of Chapter VIII. In an ensemble distributed with uniform density 
within those limits, and zero density outside, the average value of 
any function of e\ and 62 is given by the quotient 


// 


udX 1 dX 2 


// 


dV l dV 2 


*If this condition is rigorously fulfilled, the parts will have no influence on 
each other, and the ensemble formed by distributing the whole microcanonically 
is too arbitrary a conception to have a real interest. The principal interest of 
the equations which we shall obtain will be in cases in which the condition is 
approximately fulfilled. But for the purposes of a theoretical discussion, it is 
of course convenient to make such a condition absolute. Compare Chapter IV, 
pp. 39 ff., where a similar condition is considered in connection with canonical 
ensembles. 
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which may also be written* 



4,1 dedV 2 


e 4,1 de dVo 


If we make the limits of integration e and e + de, we get the average 
value of u in an ensemble in which the whole system is microcanon- 
ically distributed in phase, viz., 

u\ e = e~+ [* u e 4 ' dV 2 , (387) 

Jv 2 =0 

where <p\ and V 2 are connected by the equation 

ei + £2 = constant = e, (388) 

and u, if given as function of ei, or of e\ and e 2 , becomes in virtue 
of the same equation a function of e 2 aloned 
Thus 


e"7i Vi I = e 4 

pe 2=e 

/ VM, 

(389) 


Jv 2 =0 


£ 

1 

-e- 

l 

= e~^V 2 \ . 

e le 

(390) 


* Where the analytical transformations are identical in form with those on 
the preceding pages, it does not appear necessary to give all the steps with the 
same detail. 

tin the applications of the equation (387), we cannot obtain all the results 
corresponding to those which we have obtained from equation (374), because 
4>p is a known function of e p , while (f> 1 must be treated as an arbitrary function 
of ei, or nearly so. 
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This requires a similar relation for canonical averages 


Again 


0 


= e~+V 


e 


dep i 
de i 


e _< hVi | Q = e~ r t >2 V2 | 0 • 

(391) 

■r‘^v.w 2 . 

Jv 2 =0 “ e i 

(392) 


But if n i > 2, e^ 1 vanishes for Vi = 0,* and 



cl r 2=t 

T ^ dV * 
de Jv 2 = o 


Hence, if rii > 2, and n 2 > 2, 


f£2=e 


' V2=0 


dtp i 
dei 


e^ 1 dH 2 . 


dep d(j)\ d(p 2 
de de± e de 2 


(393) 


(394) 


and _ 

1 dcj) 

0 de 

We have compared certain functions of the energy of the whole 
system with average values of similar functions of the kinetic energy 
of the whole system, and with average values of similar functions of 
the whole energy of a part of the system. We may also compare the 
same functions with average values of the kinetic energy of a part 
of the system. 

We shall express the total, kinetic, and potential energies of 
the whole system by e, e p , and e q , and the kinetic energies of the 
parts by ei p and e 2p . These kinetic energies are necessarily separate: 
we need not make any supposition concerning potential energies. 


dep i 

d<p2 

b de i 

0 de 2 


e 


(395) 


See Chapter VIII, equations (306) and (316). 
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The extension-in-phase within any limits which can be expressed in 
terms of e q , ei p , e 2p may be represented in the notations of Chap¬ 
ter Vlll by the triple integral 

Iff dV lp dV 2p dV q 

taken within those limits. And if an ensemble of systems is dis¬ 
tributed with a uniform density within those limits, the average 
value of any function of e q , ei p , e 2p will be expressed by the quotient 




u dVi p dV 2p dV q 
dVi p dV 2p dV q 


or 



^ lp de dV 2p dV q 



^ dedV 2p dV q 


To get the average value of u for a microcanonical distribution, we 
must make the limits e and e + de. The denominator in this case 
becomes de, and we have 




r^2p—€q 

' £2p=0 


ue^'» dV 2p dV q , 


where 0i p , V 2p , and V q are connected by the equation 


(396) 


ei p + e 2p + e q = constant = e. 


Accordingly 
e -<t>i P y lp = e “* 



f‘£2p—€—£q 

e 2p— 0 


V 1 p dV 2p dV q = e-*V, 


(397) 
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and we may write 


e~*V = e~^V lp \ ( = e-^V 2p 




(398) 


and 


0 = e =e ^V_ i p L = e L = — e ip | 0 = — e 2p | e . 


ni 


n 2 


Again, if n i > 2, 


(399) 


dcfrip 

deip 


= e~* r 6 [ €2P € eq d ^e^dV 2p dV q 


'V q =0 J €2p = 0 


= e 


'V,,=0 dtp 


d ^dV, = e-^ = f. (400) 
ae de 


Hence, if rii > 2, and n 2 > 2, 


d(j) d(f)lr 


de de 


i v 


d(j> 2p 
de 


2 v 


= (ini - l) e ip | e = (^2 - 1)6^ | £ , (401) 


1 d<p 

d(j)\p 

dej)2p 

0 de 

0 dt\ v 

0 d^2p 


= (ini - l) e i p | e = (\n 2 ~ l)e 


-1 


(402) 


We cannot apply the methods employed in the preceding pages 
to the microcanonical averages of the (generalized) forces Ai, A 2 , 
etc., exerted by a system on external bodies, since these quantities 
are not functions of the energies, either kinetic or potential, of the 
whole or any part of the system. We may however use the method 
described on page 136. 
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Let us imagine an ensemble of systems distributed in phase ac¬ 
cording to the index of probability 


c — 


(c 



) 


where e! is any constant which is a possible value of the energy, 
except only the least value which is consistent with the values of 
the external coordinates, and c and c o are other constants. We have 
therefore 

f a ii r ( e ~ e ') 2 

■ e c ~ * 2 dp!... dq n = 1, (403) 

J phases J 

0r / 2 

e~ c = [ - a - U - [ e~ ^ dpi... dq n , (404) 

J phases J 

or again 


From (404) we have 


F - e ') 2 , , 
+(p de. 


(405) 


>v =o 


de c 
da\ 


/ all 
phases 



h-O 2 


dpi...dq n 





+(t> de, 


(406) 


where A\ | e denotes the average value of A\ in those systems of the 
ensemble which have any same energy e. (This is the same thing as 
the average value of A\ in a microcanonical ensemble of energy e.) 
The validity of the transformation is evident, if we consider sepa¬ 
rately the part of each integral which lies between two infinitesimally 
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differing limits of energy. Integrating by parts, we get 


de c 
da i 





+<b 


J v=o 



fdA 1 
\ de 


+ Ai | 





+< ^ de. 


(407) 


Differentiating (405), we get 


de c 
dai 


!v =o da± 


de 




^ , (408) 


dai J y _q 


where e a denotes the least value of e consistent with the external 
coordinates. The last term in this equation represents the part of 
de~ c /da\ which is due to the variation of the lower limit of the in¬ 
tegral. It is evident that the expression in the brackets will vanish 
at the upper limit. At the lower limit, at which e p = 0, and e q has 
the least value consistent with the external coordinates, the average 
sign on A\\ t is superfluous, as there is but one value of A\ which is 
represented by —de a /dai. Exceptions may indeed occur for partic¬ 
ular values of the external coordinates, at which de a /da\ receive a 
finite increment, and the formula becomes illusory. Such particular 
values we may for the moment leave out of account. The last term 
of (408) is therefore equal to the first term of the second member 
of (407). (We may observe that both vanish when n > 2 on account 
of the factor eA) 

We have therefore from these equations 


’ v=o 


dAx\ e -j- d(p\ 

——^ + AiL— e v ~ 

de de J 


de = I —e 
v =o 




de, 


or 


<v =o 


gL4i I . deb deb \ c h~ e ') 2 , 6 , , 

— +Ai\ f -j- - - ) e oj 2 de = 0. 409 

de de dai J 
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That is: the average value in the ensemble of the quantity repre¬ 
sented by the principal parenthesis is zero. This must be true for 
any value of u. If we diminish oj, the average value of the parenthe¬ 
sis at the limit when oo vanishes becomes identical with the value 
for e = e '. But this may be any value of the energy, except the least 
possible. We have therefore 

dA\ | e dcj) d<fi 

de 1 6 de dai 

unless it be for the least value of the energy consistent with 
the external coordinates, or for particular values of the external 
coordinates. But the value of any term of this equation as de¬ 
termined for particular values of the energy and of the external 
coordinates is not distinguishable from its value as determined for 
values of the energy and external coordinates indefinitely near those 
particular values. The equation therefore holds without limitation. 
Multiplying by e^, we get 



e0 dA 1 \ e + = ^ d$_ = de^ = d 2 V 

de e de dai dai dai de 


(411) 


The integral of this equation is 


A 1 \ e e+ = ^ + F 1 , 

CLCi\ 


(412) 


where F\ is a function of the external coordinates. We have an 
equation of this form for each of the external coordinates. This 
gives, with (266), for the complete value of the differential of V 


dV = e*de+ (e*A l 


F\) da\ + {e^A 2 


F 2 )da 2 +etc., ( 413 ) 
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or 

dV = e^(de+Ai | £ dai+A 2 \ e da 2 +etc.)—F\ da\—F 2 da 2 — etc. (414) 

To determine the values of the functions F \, F 2 , etc., let us suppose 
ai, a 2l etc. to vary arbitrarily, while e varies so as always to have the 
least value consistent with the values of the external coordinates. 
This will make V = 0, and dV = 0. If n < 2, we shall have also 
= 0, which will give 


F\ =0, F 2 — 0, etc. (415) 

The result is the same for any value of n. For in the variations 
considered the kinetic energy will be constantly zero, and the po¬ 
tential energy will have the least value consistent with the external 
coordinates. The condition of the least possible potential energy 
may limit the ensemble at each instant to a single configuration, or 
it may not do so; but in any case the values of Ai, A 2 , etc. will be 
the same at each instant for all the systems of the ensemble,* and 
the equation 

da T A\ dcL\ T A 2 do -2 T etc. = 0 

will hold for the variations considered. Hence the functions F \, F 2 , 
etc. vanish in any case, and we have the equation 

dV = de + e^ A\ | £ da\ + A 2 | e da 2 + etc., (416) 

or _ 

e dai T A 2 ( da 2 T etc. 

e-W 

*This statement, as mentioned before, may have exceptions for particular 
values of the external coordinates. This will not invalidate the reasoning, which 
has to do with varying values of the external coordinates. 



dlogV = 


da T Ai 
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or again 

de = e~^VdlogV — A { | e da± — A 2 | e da 2 — etc. (418) 

It will be observed that the two last equations have the form of the 
fundamental differential equations of thermodynamics, eT°V corre¬ 
sponding to temperature and log V to entropy. We have already 
observed properties of e~^V suggestive of an analogy with temper¬ 
ature.* The significance of these facts will be discussed in another 
chapter. 

The two last equations might be written more simply 

de T A\ I dai -f- A 2 I da 2 -|- etc. 

dV = ---t— -, 

e-b 

de = e ^ dV — Ai | e da\ — A 2 | £ da 2 — etc., 

and still have the form analogous to the thermodynamic equations, 
but e~^ has nothing like the analogies with temperature which we 
have observed in e~^V. 


See Chapter IX, page 131; also this chapter, page 140. 




XI. 


MAXIMUM AND MINIMUM PROPERTIES OF VARIOUS 
DISTRIBUTIONS IN PHASE. 

In the following theorems we suppose, as always, that the systems 
forming an ensemble are identical in nature and in the values of the 
external coordinates, which are here regarded as constants. 

Theorem I. If an ensemble of systems is so distributed in phase 
that the index of probability is a function of the energy, the average 
value of the index is less than for any other distribution in which 
the distribution in energy is unaltered. 

Let us write rj for the index which is a function of the energy, and 
rj + Arj for any other which gives the same distribution in energy. 
It is to be proved that 

l ■ a - U - [(:n + Aq)e v+Ar] dj)\ ... dq n > [ • aU - [ qe v dp 1 . . . dq n , 

J phases J J phases J 

(419) 

where rj is a function of the energy, and Arj a function of the phase, 
which are subject to the conditions that 


/ all 

phases ( 


e v+Ari dpi . .. dq n = 


/ all 

phases < 


e v dpi... dq n = 1, (420) 


and that for any value of the energy (e') 


e=e'-\-de' 


e v+Ar] dpi... dq n = 


f*e=e'+de' 


e v dpi .. .dq n . (421) 


Equation (420) expresses the general relations which rj and q + Arj 
must satisfy in order to be indices of any distributions, and 
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(421) expresses the condition that they give the same distribution 
in energy. 

Since p is a function of the energy, and may therefore be re¬ 
garded as a constant within the limits of integration of (421), we 
may multiply by p under the integral sign in both members, which 
gives 



pe v+Ari dpi... dq n 


-e=E '+de' 


/ 


77 c 77 dpi... dq n . 


Since this is true within the limits indicated, and for every value 
of e', it will be true if the integrals are taken for all phases. We may 
therefore cancel the corresponding parts of (419), which gives 


f , aU . f A 77 e v+^ri _ ( jq n > Q 

J phases J 

But by (420) this is equivalent to 


(422) 


y(A?j e Arl + 1 — e Arl )e’ 1 dpi dq n > 0. (423) 

Now Ap e Ari + 1 — e Ar? is a decreasing function of Ap for negative 
values of Ap, and an increasing function of Ap for positive values 
of Ap. It vanishes for Ap = 0. The expression is therefore incapable 
of a negative value, and can have the value 0 only for Ap = 0. The 
inequality (423) will hold therefore unless Ap = 0 for all phases. 
The theorem is therefore proved. 

Theorem II. If an ensemble of systems is canonically distributed 
in phase, the average index of probability is less than in any other 
distribution of the ensemble having the same average energy. 

For the canonical distribution let the index be (ijj — e)/0, and 
for another having the same average energy let the index be 


/ all 
phases 
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(0 — e)/0 + Ar/, where Ar/ is an arbitrary function of the phase 
subject only to the limitation involved in the notion of the index, 
that 


/ all 
phases 


e © +A ' ? dpi... dq, n 


I all 

phases 


ip—e 

e~Q~ dpi... dq n 


and to that relating to the constant average energy, that 


1 , 

(424) 


/ all 
phases < 


e e © +An dpi ... dq n = 


/ all 
phases ( 


'ijj—e 

e e © dpi... dq n . 


(425) 


It is to be proved that 


/ all 
phases 


/ 




ib—e 


dpi ■ ■ ■ dq n 


> 


/ all 
phases 


/ 




'ip—e 

e © dpi... dq n . 


(426) 


Now in virtue of the first condition (424) we may cancel the constant 
term 0/0 in the parentheses in (426), and in virtue of the second 
condition (425) we may cancel the term e/0. The proposition to be 
proved is thus reduced to 

[ ■ aU - [ Ar/ e~o~ +Ar> dpi... dq n > 0, 

J phases J 

which may be written, in virtue of the condition (424), 

/(A, e Ar) + 1 — e A?? )e _ © _ dpi... dq n > 0. (427) 

In this form its truth is evident for the same reasons which applied 
to (423). 


/ all 
phases 
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Theorem. III. If 0 is any positive constant, the average value 
in an ensemble of the expression rj + e /0 (77 denoting as usual the 
index of probability and e the energy) is less when the ensemble is 
distributed canonically with modulus 0 , than for any other distri¬ 
bution whatever. 

In accordance with our usual notation let us write (if; — e) /@ for 
the index of the canonical distribution. In any other distribution 
let the index be (^ — e)/0 + A 77 . 

In the canonical ensemble 77 + e/0 has the constant value -0/0; 
in the other ensemble it has the value -0/0 + Arj. The proposition 
to be proved may therefore be written 

where 

[ • a - 1 - f e~&~ +An dpi... dq n — f — f e~&~ dp\ ... dq n = 1. 

J phases J J phases J 

(429) 

In virtue of this condition, since -0/0 is constant, the proposition 
to be proved reduces to 

0 < [ - aU - [ Ap e V+ Ar ? dpi... dq n , (430) 

J phases J 

where the demonstration may be concluded as in the last theorem. 

If we should substitute for the energy in the preceding theorems 
any other function of the phase, the theorems, mutatis mutandis, 
would still hold. On account of the unique importance of the energy 
as a function of the phase, the theorems as given are especially 
worthy of notice. When the case is such that other functions of the 
phase have important properties relating to statistical equilibrium, 
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as described in Chapter IV,* the three following theorems, which are 
generalizations of the preceding, may be useful. It will be sufficient 
to give them without demonstration, as the principles involved are 
in no respect different. 

Theorem IV. If an ensemble of systems is so distributed in phase 
that the index of probability is any function of F \, F 2 , etc., (these 
letters denoting functions of the phase,) the average value of the 
index is less than for any other distribution in phase in which the 
distribution with respect to the functions ip, F 2 , etc. is unchanged. 

Theorem V. If an ensemble of systems is so distributed in phase 
that the index of probability is a linear function of F \, F 2 , etc., 
(these letters denoting functions of the phase,) the average value 
of the index is less than for any other distribution in which the 
functions fq, F 2 , etc. have the same average values. 

Theorem VI. The average value in an ensemble of systems of 
ij + F (where ■// denotes as usual the index of probability and F any 
function of the phase) is less when the ensemble is so distributed 
that rj + F is constant than for any other distribution whatever. 

Theorem VII. If a system which in its different phases consti¬ 
tutes an ensemble consists of two parts, and we consider the average 
index of probability for the whole system, and also the average in¬ 
dices for each of the parts taken separately, the sum of the average 
indices for the parts will be either less than the average index for the 
whole system, or equal to it, but cannot be greater. The limiting 
case of equality occurs when the distribution in phase of each part 
is independent of that of the other, and only in this case. 

Let the coordinates and momenta of the whole system be 
qi, • • • In, Pi, ■■■Pn, of which q u ...q m , Pi, ■■■Pm relate to one 


See pages 41-46. 
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part of the system, and q m +i, ■ ■ ■ Qm Pm+ 1 , ■ ■ • p n to the other. If 
the index of probability for the whole system is denoted by r), the 
probability that the phase of an unspecified system lies within any 
given limits is expressed by the integral 


e v dpi . . . dq n 


(431) 


taken for those limits. If we set 



dp m +1 ■■■dp n dq m+ 1 ... dq n 


= e”\ 


(432) 


where the integrations cover all phases of the second system, and 


e v dp i... dp m dq 1 ... dq m = e m 


(433) 


where the integrations cover all phases of the first system, the inte¬ 
gral (431) will reduce to the form 


e m dpi... dp m dq 1 . . . dq r 


(434) 


when the limits can be expressed in terms of the coordinates and 
momenta of the first part of the system. The same integral will 
reduce to 

e m dp m+ i ... dp n dq m+ \ ... dq n , (435) 

when the limits can be expressed in terms of the coordinates and 
momenta of the second part of the system. It is evident that r/i and 
r] 2 are the indices of probability for the two parts of the system 
taken separately. 
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The main proposition to be proved may be written 


Vie m dpi... dq m + 


r] 2 e m dp m+ 1 ...dq n 


< 


r]e v dpi... dq n , (436) 


where the first integral is to be taken over all phases of the first 
part of the system, the second integral over all phases of the second 
part of the system, and the last integral over all phases of the whole 
system. Now we have 


J • • / e v dpi... dq n = 1, (437) 

J- ■ / e m dpi ... dq m = 1, (438) 


and 


e m dp m+ 1 ...dq n = 1, 


(439) 


where the limits cover in each case all the phases to which the 
variables relate. The two last equations, which are in themselves 
evident, may be derived by partial integration from the first. 

ft appears from the definitions of r]i and that (436) may also 
be written 


Vie v dpi... dq n + 


rj 2 e r] dpi . .. dq 


n 


< 


7]e v dpi... dq n , (440) 
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or 

(rj 'i'll - m)e v dpi... dq n > 0, 

where the integrations cover all phases. Adding the equation 

J...J e m+m dpi... dq n — 1, (441) 

which we get by multiplying (438) and (439), and subtracting (437), 
we have for the proposition to be proved 

J [(v ~ Vi ~ V 2 )d n + e m+1>2 — e v ] dpi... dq n > 0. (442) 

Let 

u = 7] ~r ]i - r] 2 . (443) 

The main proposition to be proved may be written 

/(««* +(444) 

This is evidently true since the quantity in the parenthesis is inca¬ 
pable of a negative value.* Moreover the sign = can hold only when 
the quantity in the parenthesis vanishes for all phases, i.e., when 
u — 0 for all phases. This makes r/ = rji + r] 2 for all phases, which 
is the analytical condition which expresses that the distributions in 
phase of the two parts of the system are independent. 

Theorem VIII. If two or more ensembles of systems which are 
identical in nature, but may be distributed differently in phase, are 
united to form a single ensemble, so that the probability-coefficient 


/ all 

phases 


/ all 
phases 



See Theorem I, where this is proved of a similar expression. 
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of the resulting ensemble is a linear function of the probability- 
coefficients of the original ensembles, the average index of proba¬ 
bility of the resulting ensemble cannot be greater than the same 
linear function of the average indices of the original ensembles. It 
can be equal to it only when the original ensembles are similarly 
distributed in phase. 

Let Pi, P‘ 2 , etc. be the probability-coefficients of the original en¬ 
sembles, and P that of the ensemble formed by combining them; 
and let N\ , 7V 2 , etc. be the numbers of systems in the original en¬ 
sembles. It is evident that we shall have 

P — C\P 1 + c 2 P 2 + etc. = ^ ] ( C\P \), (445) 

where 

Ni N 2 , s 

Cl = £lV C2= E^’ etc ' (446) 

The main proposition to be proved is that 

/ ■ aU - f P log P dpi... dq n < V c, / - aU - [ Pi log Pi dpi... dq n 

J phases J ' J phases J 

(447) 

or 

[ aU [ [V(ciPi log Pi) - Plogp] dpi... dq n > 0. (448) 

J phases J L ‘ * 

If we set 

Q i = Pi log Pi - Pi log P - Pi + P 

Q 1 will be positive, except when it vanishes for Pi = P. To prove 
this, we may regard Pi and P as any positive quantities. Then 

(S)) P = logPl “ logP ' 
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(d 2 QA = 1 

\dP?)p Pi- 

Since Q i and dQi/dPi vanish for P\ = P, and the second 
differential coefhcient is always positive, Q i must be positive except 
when Pi = P. Therefore, if Q 2 , etc. have similar definitions, 

( 449 ) 

But since 

5>iA) = p 

and 


= i, 

J>iQi) = ^(dPilogPi) -PlogP (450) 

This proves (448), and shows that the sign = will hold only when 

Pi = P, P 2 = P, etc. 

for all phases, i.e., only when the distribution in phase of the original 
ensembles are all identical. 

Theorem IX. A uniform distribution of a given number of sys¬ 
tems within given limits of phase gives a less average index of prob¬ 
ability of phase than any other distribution. 

Let r) be the constant index of the uniform distribution, and 
r] + Ar] the index of some other distribution. Since the number of 
systems within the given limits is the same in the two distributions 
we have 

J...J e ^d Pl ...dq n = f-f 


e" dpi... dq n . 


(451) 
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where the integrations, like those which follow, are to be taken 
within the given limits. The proposition to be proved may be writ¬ 
ten 


(rj + A r])e v+Ar> dpi . .. dq n > /• • ■ / r]e v dpi . .. dq n , (452) 


or, since rj is constant, 


(rj + Ap)e Ari dp x ...dq n > /• • ■ / 77 dpi... dq n . (453) 


In (451) also we may cancel the constant factor e 71 , and multiply by 
the constant factor (jj + 1). This gives 


(rj + l)e Av dpi... dq n = / • • • / (rj + 1) dp x ... dq r 


The subtraction of this equation will not alter the inequality to be 
proved, which may therefore be written 


(A?? — l)e Av dpi... dq„ 


> 


...dq n 


or 

(Ar]e Av — e Ari + 1) dpi ... dq n > 0. (454) 

Since the parenthesis in this expression represents a positive value, 
except when it vanishes for At] = 0, the integral will be positive un¬ 
less Ap vanishes everywhere within the limits, which would make 
the difference of the two distributions vanish. The theorem is there¬ 
fore proved. 



XII. 


ON THE MOTION OF SYSTEMS AND ENSEMBLES OF 
SYSTEMS THROUGH LONG PERIODS OF TIME. 

An important question which suggests itself in regard to any case 
of dynamical motion is whether the system considered will return 
in the course of time to its initial phase, or, if it will not return 
exactly to that phase, whether it will do so to any required degree 
of approximation in the course of a sufficiently long time. To be 
able to give even a partial answer to such questions, we must know 
something in regard to the dynamical nature of the system. In the 
following theorem, the only assumption in this respect is such as we 
have found necessary for the existence of the canonical distribution. 

If we imagine an ensemble of identical systems to be distributed 
with a uniform density throughout any finite extension-in-phase, 
the number of the systems which leave the extension-in-phase and 
will not return to it in the course of time is less than any assignable 
fraction of the whole number; provided, that the total extension-in- 
phase for the systems considered between two limiting values of the 
energy is finite, these limiting values being less and greater respec¬ 
tively than any of the energies of the first-mentioned extension-in- 
phase. 

To prove this, we observe that at the moment which we call ini¬ 
tial the systems occupy the given extension-in-phase. It is evident 
that some systems must leave the extension immediately, unless all 
remain in it forever. Those systems which leave the extension at 
the first instant, we shall call the front of the ensemble. It will 
be convenient to speak of this front as generating the extension-in- 
phase through which it passes in the course of time, as in geometry 
a surface is said to generate the volume through which it passes. In 
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equal times the front generates equal extensions in phase. This is an 
immediate consequence of the principle of conservation of extension- 
in-phase, unless indeed we prefer to consider it as a slight variation 
in the expression of that principle. For in two equal short intervals 
of time let the extensions generated be A and B. (We make the 
intervals short simply to avoid the complications in the enunciation 
or interpretation of the principle which would arise when the same 
extension-in-phase is generated more than once in the interval con¬ 
sidered.) Now if we imagine that at a given instant systems are 
distributed throughout the extension A, it is evident that the same 
systems will after a certain time occupy the extension B, which 
is therefore equal to A in virtue of the principle cited. The front 
of the ensemble, therefore, goes on generating equal extensions in 
equal times. But these extensions are included in a finite extension, 
viz., that bounded by certain limiting values of the energy. Sooner 
or later, therefore, the front must generate phases which it has be¬ 
fore generated. Such second generation of the same phases must 
commence with the initial phases. Therefore a portion at least of 
the front must return to the original extension-in-phase. The same 
is of course true of the portion of the ensemble which follows that 
portion of the front through the same phases at a later time. 

It remains to consider how large the portion of the ensemble is, 
which will return to the original extension-in-phase. There can be 
no portion of the given extension-in-phase, the systems of which 
leave the extension and do not return. For we can prove for any 
portion of the extension as for the whole, that at least a portion of 
the systems leaving it will return. 

We may divide the given extension-in-phase into parts as follows. 
There may be parts such that the systems within them will never 
pass out of them. These parts may indeed constitute the whole of 
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the given extension. But if the given extension is very small, these 
parts will in general be non-*existent. There may be parts such 
that systems within them will all pass out of the given extension 
and all return within it. The whole of the given extension-in-phase 
is made up of parts of these two kinds. This does not exclude the 
possibility of phases on the boundaries of such parts, such that 
systems starting with those phases would leave the extension and 
never return. But in the supposed distribution of an ensemble of 
systems with a uniform density-in-phase, such systems would not 
constitute any assignable fraction of the whole number. 

These distinctions may be illustrated by a very simple example. 
If we consider the motion of a rigid body of which one point is 
fixed, and which is subject to no forces, we find three cases. (1) The 
motion is periodic. (2) The system will never return to its original 
phase, but will return infinitely near to it. (3) The system will never 
return either exactly or approximately to its original phase. But if 
we consider any extension-in-phase, however small, a system leaving 
that extension will return to it except in the case called by Poinsot 
‘singular,’ viz., when the motion is a rotation about an axis lying in 
one of two planes having a fixed position relative to the rigid body. 
But all such phases do not constitute any true extension-in-phase 
in the sense in which we have defined and used the term.* 

* An ensemble of systems distributed in phase is a less simple and elementary 
conception than a single system. But by the consideration of suitable ensembles 
instead of single systems, we may get rid of the inconvenience of having to con¬ 
sider exceptions formed by particular cases of the integral equations of motion, 
these cases simply disappearing when the ensemble is substituted for the single 
system as a subject of study. This is especially true when the ensemble is dis¬ 
tributed, as in the case called canonical, throughout an extension-in-phase. In 
a less degree it is true of the microcanonical ensemble, which does not occupy 
any extension-in-phase, (in the sense in which we have used the term,) although 
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In the same way it may be proved that the systems in a canonical 
ensemble which at a given instant are contained within any finite 
extension-in-phase will in general return to that extension-in-phase, 
if they leave it, the exceptions, i.e., the number which pass out of 
the extension-in-phase and do not return to it, being less than any 
assignable fraction of the whole number. In other words, the prob¬ 
ability that a system taken at random from the part of a canonical 
ensemble which is contained within any given extension-in-phase, 
will pass out of that extension and not return to it, is zero. 

A similar theorem may be enunciated with respect to a micro- 
canonical ensemble. Let us consider the fractional part of such an 
ensemble which lies within any given limits of phase. This fraction 
we shall denote by F. It is evidently constant in time since the en¬ 
semble is in statistical equilibrium. The systems within the limits 
will not in general remain the same, but some will pass out in each 
unit of time while an equal number come in. Some may pass out 
never to return within the limits. But the number which in any time 
however long pass out of the limits never to return will not bear any 
finite ratio to the number within the limits at a given instant. For, 
if it were otherwise, let / denote the fraction representing such ratio 
for the time T. Then, in the time T, the number which pass out 
never to return will bear the ratio fF to the whole number in the 
ensemble, and in a time exceeding T / (fF) the number which pass 
out of the limits never to return would exceed the total number of 
systems in the ensemble. The proposition is therefore proved. 

This proof will apply to the cases before considered, and may be 
regarded as more simple than that which was given. It may also be 

it is convenient to regard it as a limiting case with respect to ensembles which 
do, as we thus gain for the subject some part of the analytical simplicity which 
belongs to the theory of ensembles which occupy true extensions-in-phase. 
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applied to any true case of statistical equilibrium. By a true case of 
statistical equilibrium is meant such as may be described by giving 
the general value of the probability that an unspecified system of 
the ensemble is contained within any given limits of phase.* 

*An ensemble in which the systems are material points constrained to move 
in vertical circles, with just enough energy to carry them to the highest points, 
cannot afford a true example of statistical equilibrium. For any other value of 
the energy than the critical value mentioned, we might in various ways describe 
an ensemble in statistical equilibrium, while the same language applied to the 
critical value of the energy would fail to do so. Thus, if we should say that the 
ensemble is so distributed that the probability that a system is in any given 
part of the circle is proportioned to the time which a single system spends in 
that part, motion in either direction being equally probable, we should perfectly 
define a distribution in statistical equilibrium for any value of the energy except 
the critical value mentioned above, but for this value of the energy all the 
probabilities in question would vanish unless the highest point is included in 
the part of the circle considered, in which case the probability is unity, or forms 
one of its limits, in which case the probability is indeterminate. Compare the 
foot-note on page 139. 

A still more simple example is afforded by the uniform motion of a material 
point in a straight line. Here the impossibility of statistical equilibrium is not 
limited to any particular energy, and the canonical distribution as well as the 
microcanonical is impossible. 

These examples are mentioned here in order to show the necessity of caution 
in the application of the above principle, with respect to the question whether 
we have to do with a true case of statistical equilibrium. 

Another point in respect to which caution must be exercised is that the part 
of an ensemble of which the theorem of the return of systems is asserted should 
be entirely defined by limits within which it is contained, and not by any such 
condition as that a certain function of phase shall have a given value. This is 
necessary in order that the part of the ensemble which is considered should be 
any assignable fraction of the whole. Thus, if we have a canonical ensemble 
consisting of material points in vertical circles, the theorem of the return of 
systems may be applied to a part of the ensemble defined as contained in a 
given part of the circle. But it may not be applied in all cases to a part of the 
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Let us next consider whether an ensemble of isolated systems 
has any tendency in the course of time toward a state of statistical 
equilibrium. 

There are certain functions of phase which are constant in time. 
The distribution of the ensemble with respect to the values of these 
functions is necessarily invariable, that is, the number of systems 
within any limits which can be specified in terms of these functions 
cannot vary in the course of time. The distribution in phase which 
without violating this condition gives the least value of the average 
index of probability of phase (fj) is unique, and is that in which the 
index of probability ( 77 ) is a function of the functions mentioned.* 
It is therefore a permanent distribution,' and the only permanent 
distribution consistent with the invariability of the distribution with 
respect to the functions of phase which are constant in time. 

It would seem, therefore, that we might find a sort of measure 
of the deviation of an ensemble from statistical equilibrium in the 
excess of the average index above the minimum which is consistent 
with the condition of the invariability of the distribution with re¬ 
spect to the constant functions of phase. But we have seen that the 
index of probability is constant in time for each system of the en¬ 
semble. The average index is therefore constant, and we find by this 
method no approach toward statistical equilibrium in the course of 
time. 

Yet we must here exercise great caution. One function may 
approach indefinitely near to another function, while some quan- 

ensemble defined as contained in a given part of the circle and having a given 
energy. It would, in fact, express the exact opposite of the truth when the given 
energy is the critical value mentioned above. 

*See Chapter XI, Theorem IV. 

^See Chapter IV, sub init. 
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tity determined by the first does not approach the corresponding 
quantity determined by the second. A line joining two points may 
approach indefinitely near to the straight line joining them, while 
its length remains constant. We may find a closer analogy with the 
case under consideration in the effect of stirring an incompressible 
liquid.* In space of 2 n dimensions the case might be made analyt¬ 
ically identical with that of an ensemble of systems of n degrees of 
freedom, but the analogy is perfect in ordinary space. Let us sup¬ 
pose the liquid to contain a certain amount of coloring matter which 
does not affect its hydrodynamic properties. Now the state in which 
the density of the coloring matter is uniform, i.e., the state of per¬ 
fect mixture, which is a sort of state of equilibrium in this respect 
that the distribution of the coloring matter in space is not affected 
by the internal motions of the liquid, is characterized by a minimum 
value of the average square of the density of the coloring matter. 
Let us suppose, however, that the coloring matter is distributed 
with a variable density. If we give the liquid any motion whatever, 
subject only to the hydrodynamic law of incompressibility,—it may 
be a steady flux, or it may vary with the time,—the density of the 
coloring matter at any same point of the liquid will be unchanged, 
and the average square of this density will therefore be unchanged. 
Yet no fact is more familiar to us than that stirring tends to bring 
a liquid to a state of uniform mixture, or uniform densities of its 
components, which is characterized by minimum values of the av¬ 
erage squares of these densities. It is quite true that in the physical 
experiment the result is hastened by the process of diffusion, but 
the result is evidently not dependent on that process. 

*By liquid is here meant the continuous body of theoretical hydrodynamics, 
and not anything of the molecular structure and molecular motions of real 
liquids. 
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The contradiction is to be traced to the notion of the density 
of the coloring matter, and the process by which this quantity is 
evaluated. This quantity is the limiting ratio of the quantity of 
the coloring matter in an element of space to the volume of that 
element. Now if we should take for our elements of volume, after 
any amount of stirring, the spaces occupied by the same portions of 
the liquid which originally occupied any given system of elements of 
volume, the densities of the coloring matter, thus estimated, would 
be identical with the original densities as determined by the given 
system of elements of volume. Moreover, if at the end of any finite 
amount of stirring we should take our elements of volume in any or¬ 
dinary form but sufficiently small, the average square of the density 
of the coloring matter, as determined by such element of volume, 
would approximate to any required degree to its value before the 
stirring. But if we take any element of space of fixed position and 
dimensions, we may continue the stirring so long that the densities 
of the colored liquid estimated for these fixed elements will approach 
a uniform limit, viz., that of perfect mixture. 

The case is evidently one of those in which the limit of a limit 
has different values, according to the order in which we apply the 
processes of taking a limit. If treating the elements of volume as 
constant, we continue the stirring indefinitely, we get a uniform 
density, a result not affected by making the elements as small as 
we choose; but if treating the amount of stirring as finite, we di¬ 
minish indefinitely the elements of volume, we get exactly the same 
distribution in density as before the stirring, a result which is not af¬ 
fected by continuing the stirring as long as we choose. The question 
is largely one of language and definition. One may perhaps be al¬ 
lowed to say that a finite amount of stirring will not affect the mean 
square of the density of the coloring matter, but an infinite amount 
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of stirring may be regarded as producing a condition in which the 
mean square of the density has its minimum value, and the density 
is uniform. We may certainly say that a sensibly uniform density of 
the colored component may be produced by stirring. Whether the 
time required for this result would be long or short depends upon 
the nature of the motion given to the liquid, and the fineness of our 
method of evaluating the density. 

All this may appear more distinctly if we consider a special case 
of liquid motion. Let us imagine a cylindrical mass of liquid of 
which one sector of 90° is black and the rest white. Let it have 
a motion of rotation about the axis of the cylinder in which the 
angular velocity is a function of the distance from the axis. In the 
course of time the black and the white parts would become drawn 
out into thin ribbons, which would be wound spirally about the axis. 
The thickness of these ribbons would diminish without limit, and 
the liquid would therefore tend toward a state of perfect mixture 
of the black and white portions. That is, in any given element 
of space, the proportion of the black and white would approach 
1 : 3 as a limit. Yet after any finite time, the total volume would be 
divided into two parts, one of which would consist of the white liquid 
exclusively, and the other of the black exclusively. If the coloring 
matter, instead of being distributed initially with a uniform density 
throughout a section of the cylinder, were distributed with a density 
represented by any arbitrary function of the cylindrical coordinates 
r, 9 and z, the effect of the same motion continued indefinitely would 
be an approach to a condition in which the density is a function of 
r and z alone. In this limiting condition, the average square of 
the density would be less than in the original condition, when the 
density was supposed to vary with 6 , although after any finite time 
the average square of the density would be the same as at first. 
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If we limit our attention to the motion in a single plane per¬ 
pendicular to the axis of the cylinder, we have something which is 
almost identical with a diagrammatic representation of the changes 
in distribution in phase of an ensemble of systems of one degree of 
freedom, in which the motion is periodic, the period varying with 
the energy, as in the case of a pendulum swinging in a circular 
arc. If the coordinates and momenta of the systems are represented 
by rectangular coordinates in the diagram, the points in the di¬ 
agram representing the changing phases of moving systems, will 
move about the origin in closed curves of constant energy. The mo¬ 
tion will be such that areas bounded by points representing moving 
systems will be preserved. The only difference between the motion 
of the liquid and the motion in the diagram is that in one case the 
paths are circular, and in the other they differ more or less from 
that form. 

When the energy is proportional to p 2 + q 2 the curves of constant 
energy are circles, and the period is independent of the energy. 
There is then no tendency toward a state of statistical equilibrium. 
The diagram turns about the origin without change of form. This 
corresponds to the case of liquid motion, when the liquid revolves 
with a uniform angular velocity like a rigid solid. 

The analogy between the motion of an ensemble of systems in 
an extension-in-phase and a steady current in an incompressible liq¬ 
uid, and the diagrammatic representation of the case of one degree 
of freedom, which appeals to our geometrical intuitions, may be 
sufficient to show how the conservation of density-in-phase, which 
involves the conservation of the average value of the index of proba¬ 
bility of phase, is consistent with an approach to a limiting condition 
in which that average value is less. We might perhaps fairly infer 
from such considerations as have been adduced that an approach 
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to a limiting condition of statistical equilibrium is the general rule, 
when the initial condition is not of that character. But the sub¬ 
ject is of such importance that it seems desirable to give it farther 
consideration. 

Let us suppose that the total extension-in-phase for the kind of 
system considered to be divided into equal elements (DV) which 
are very small but not infinitely small. Let us imagine an ensemble 
of systems distributed in this extension in a manner represented 
by the index of probability 77 , which is an arbitrary function of the 
phase subject only to the restriction expressed by equation (46) of 
Chapter I. We shall suppose the elements DV to be so small that 
r) may in general be regarded as sensibly constant within any one 
of them at the initial moment. Let the path of a system be defined 
as the series of phases through which it passes. 

At the initial moment (t') a certain system is in an element of 
extension DV'. Subsequently, at the time t", the same system is 
in the element DV". Other systems which were at first in DV' 
will at the time t" be in DV", but not all, probably. The systems 
which were at first in DV' will at the time t" occupy an extension-in- 
phase exactly as large as at first. But it will probably be distributed 
among a very great number of the elements (DV) into which we have 
divided the total extension-in-phase. If it is not so, we can generally 
take a later time at which it will be so. There will be exceptions to 
this for particular laws of motion, but we will confine ourselves to 
what may fairly be called the general case. Only a very small part 
of the systems initially in DV' will be found in DV" at the time t", 
and those which are found in DV" at that time were at the initial 
moment distributed among a very large number of elements DV. 

What is important for our purpose is the value of 77 , the index 
of probability of phase in the element DV" at the time t". In the 
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part of DV" occupied by systems which at the time t' were in DV' 
the value of 77 will be the same as its value in DV' at the time t', 
which we shall call rf. In the parts of DV" occupied by systems 
which at t' were in elements very near to DV' we may suppose the 
value of 77 to vary little from r(. We cannot assume this in regard 
to parts of DV" occupied by systems which at t' were in elements 
remote from DV'. We want, therefore, some idea of the nature of 
the extension-in-phase occupied at t' by the systems which at t" will 
occupy DV". Analytically, the problem is identical with finding the 
extension occupied at t" by the systems which at t' occupied DV'. 
Now the systems in DV" which lie on the same path as the system 
first considered, evidently arrived at DV" at nearly the same time, 
and must have left DV' at nearly the same time, and therefore at if 
were in or near DV'. We may therefore take rj as the value for these 
systems. The same essentially is true of systems in DV" which lie on 
paths very close to the path already considered. But with respect 
to paths passing through DV' and DV", but not so close to the 
first path, we cannot assume that the time required to pass from 
DV' to DV" is nearly the same as for the first path. The difference 
of the times required may be small in comparison with t" — t', but 
as this interval can be as large as we choose, the difference of the 
times required in the different paths has no limit to its possible 
value. Now if the case were one of statistical equilibrium, the value 
of 77 would be constant in any path, and if all the paths which 
pass through DV" also pass through or near DV', the value of 77 
throughout DV" will vary little from 7/. But when the case is not 
one of statistical equilibrium, we cannot draw any such conclusion. 
The only conclusion which we can draw with respect to the phase 
at t' of the systems which at t" are in DV" is that they are nearly 
on the same path. 
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Now if we should make a new estimate of indices of probability of 
phase at the time t", using for this purpose the elements DV ,—that 
is, if we should divide the number of systems in DV", for example, 
by the total number of systems, and also by the extension-in-phase 
of the element, and take the logarithm of the quotient, we would 
get a number which would be less than the average value of r/ for 
the systems within DV" based on the distribution in phase at the 
time t' * Hence the average value of r) for the whole ensemble of 
systems based on the distribution at t" will be less than the average 
value based on the distribution at t'. 

We must not forget that there are exceptions to this general rule. 
These exceptions are in cases in which the laws of motion are such 
that systems having small differences of phase will continue always 
to have small differences of phase. 

It is to be observed that if the average index of probability in 
an ensemble may be said in some sense to have a less value at one 
time than at another, it is not necessarily priority in time which 
determines the greater average index. If a distribution, which is not 
one of statistical equilibrium, should be given for a time t ', and the 
distribution at an earlier time t" should be defined as that given 
by the corresponding phases, if we increase the interval leaving t' 
fixed and taking t" at an earlier and earlier date, the distribution 
at t" will in general approach a limiting distribution which is in 
statistical equilibrium. The determining difference in such cases is 
that between a definite distribution at a definite time and the limit 
of a varying distribution when the moment considered is carried 
either forward or backward indefinitely. 

*See Chapter XI, Theorem IX. 

lOne may compare the kinematical truism that when two points are moving 
with uniform velocities, (with the single exception of the case where the relative 
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But while the distinction of prior and subsequent events may be 
immaterial with respect to mathematical fictions, it is quite other¬ 
wise with respect to the events of the real world. It should not be 
forgotten, when our ensembles are chosen to illustrate the proba¬ 
bilities of events in the real world, that while the probabilities of 
subsequent events may often be determined from the probabilities 
of prior events, it is rarely the case that probabilities of prior events 
can be determined from those of subsequent events, for we are rarely 
justified in excluding the consideration of the antecedent probability 
of the prior events. 

It is worthy of notice that to take a system at random from 
an ensemble at a date chosen at random from several given dates, 
t', t", etc., is practically the same thing as to take a system at 
random from the ensemble composed of all the systems of the given 
ensemble in their phases at the time t', together with the same 
systems in their phases at the time t", etc. By Theorem VIII of 
Chapter XI this will give an ensemble in which the average index 
of probability will be less than in the given ensemble, except in the 
case when the distribution in the given ensemble is the same at 
the times t 1 , t", etc. Consequently, any indefiniteness in the time 
in which we take a system at random from an ensemble has the 
practical effect of diminishing the average index of the ensemble 
from which the system may be supposed to be drawn, except when 
the given ensemble is in statistical equilibrium. 


motion is zero,) their mutual distance at any definite time is less than for t = oo, 
or t = —oo. 



XIII. 


EFFECT OF VARIOUS PROCESSES ON AN ENSEMBLE OF 

SYSTEMS. 

In the last chapter and in Chapter I we have considered the changes 
which take place in the course of time in an ensemble of isolated 
systems. Let us now proceed to consider the changes which will take 
place in an ensemble of systems under external influences. These ex¬ 
ternal influences will be of two kinds, the variation of the coordinates 
which we have called external, and the action of other ensembles of 
systems. The essential difference of the two kinds of influence con¬ 
sists in this, that the bodies to which the external coordinates relate 
are not distributed in phase, while in the case of interaction of the 
systems of two ensembles, we have to regard the fact that both are 
distributed in phase. To find the effect produced on the ensem¬ 
ble with which we are principally concerned, we have therefore to 
consider single values of what we have called external coordinates, 
but an infinity of values of the internal coordinates of any other 
ensemble with which there is interaction. 

Or,—to regard the subject from another point of view,—the ac¬ 
tion between an unspecified system of an ensemble and the bodies 
represented by the external coordinates, is the action between a sys¬ 
tem imperfectly determined with respect to phase and one which is 
perfectly determined; while the interaction between two unspecified 
systems belonging to different ensembles is the action between two 
systems both of which are imperfectly determined with respect to 
phase.* 

*In the development of the subject, we shall find that this distinction corre¬ 
sponds to the distinction in thermodynamics between mechanical and thermal 
action. 
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We shall suppose the ensembles which we consider to be dis¬ 
tributed in phase in the manner described in Chapter I, and 
represented by the notations of that chapter, especially by the 
index of probability of phase (rj). There are therefore 2 n inde¬ 
pendent variations in the phases which constitute the ensembles 
considered. This excludes ensembles like the nricrocanonical, in 
which, as energy is constant, there are only 2 n — 1 independent 
variations of phase. This seems necessary for the purposes of a 
general discussion. For although we may imagine a nricrocanon¬ 
ical ensemble to have a permanent existence when isolated from 
external influences, the effect of such influences would generally 
be to destroy the uniformity of energy in the ensemble. Moreover, 
since the nricrocanonical ensemble may be regarded as a limiting 
case of such ensembles as are described in Chapter I, (and that 
in more than one way, as shown in Chapter X,) the exclusion is 
rather formal than real, since any properties which belong to the 
nricrocanonical ensemble could easily be derived from those of the 
ensembles of Chapter I, which in a certain sense may be regarded 
as representing the general case. 

Let us first consider the effect of variation of the external 
coordinates. We have already had occasion to regard these quan¬ 
tities as variable in the differentiation of certain equations relating 
to ensembles distributed according to certain laws called canonical 
or nricrocanonical. That variation of the external coordinates was, 
however, only carrying the attention of the mind from an ensemble 
with certain values of the external coordinates, and distributed 
in phase according to some general law depending upon those 
values, to another ensemble with different values of the external 
coordinates, and with the distribution changed to conform to these 
new values. 
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What we have now to consider is the effect which would actually 
result in the course of time in an ensemble of systems in which 
the external coordinates should be varied in any arbitrary manner. 
Let us suppose, in the first place, that these coordinates are varied 
abruptly at a given instant, being constant both before and after 
that instant. By the definition of the external coordinates it appears 
that this variation does not affect the phase of any system of the 
ensemble at the time when it takes place. Therefore it does not 
affect the index of probability of phase ( 77 ) of any system, or the 
average value of the index (fj) at that time. And if these quantities 
are constant in time before the variation of the external coordinates, 
and after that variation, their constancy in time is not interrupted 
by that variation. In fact, in the demonstration of the conservation 
of probability of phase in Chapter I, the variation of the external 
coordinates was not excluded. 

But a variation of the external coordinates will in general disturb 
a previously existing state of statistical equilibrium. For, although 
it does not affect (at the first instant) the distribution-in-phase, it 
does affect the condition necessary for equilibrium. This condition, 
as we have seen in Chapter IV, is that the index of probability 
of phase shall be a function of phase which is constant in time 
for moving systems. Now a change in the external coordinates, by 
changing the forces which act on the systems, will change the nature 
of the functions of phase which are constant in time. Therefore, the 
distribution in phase which was one of statistical equilibrium for the 
old values of the external coordinates, will not be such for the new 
values. 

Now we have seen, in the last chapter, that when the distribu¬ 
tion-in-phase is not one of statistical equilibrium, an ensemble of 
systems may, and in general will, after a longer or shorter time, 
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come to a state which may be regarded, if very small differences of 
phase are neglected, as one of statistical equilibrium, and in which 
consequently the average value of the index (fj) is less than at first. 
It is evident, therefore, that a variation of the external coordinates, 
by disturbing a state of statistical equilibrium, may indirectly cause 
a diminution, (in a certain sense at least,) of the value of fj. 

But if the change in the external coordinates is very small, the 
change in the distribution necessary for equilibrium will in general 
be correspondingly small. Hence, the original distribution in phase, 
since it differs little from one which would be in statistical equi¬ 
librium with the new values of the external coordinates, may be 
supposed to have a value of fj which differs by a small quantity of 
the second order from the minimum value which characterizes the 
state of statistical equilibrium. And the diminution in the average 
index resulting in the course of time from the very small change in 
the external coordinates, cannot exceed this small quantity of the 
second order. 

Hence also, if the change in the external coordinates of an en¬ 
semble initially in statistical equilibrium consists in successive very 
small changes separated by very long intervals of time in which the 
disturbance of statistical equilibrium becomes sensibly effaced, the 
final diminution in the average index of probability will in general 
be negligible, although the total change in the external coordinates 
is large. The result will be the same if the change in the external 
coordinates takes place continuously but sufficiently slowly. 

Even in cases in which there is no tendency toward the restora¬ 
tion of statistical equilibrium in the lapse of time, a variation of 
external coordinates which would cause, if it took place in a short 
time, a great disturbance of a previous state of equilibrium, may, if 
sufficiently distributed in time, produce no sensible disturbance of 
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the statistical equilibrium. 

Thus, in the case of three degrees of freedom, let the systems be 
heavy points suspended by elastic massless cords, and let the en¬ 
semble be distributed in phase with a density proportioned to some 
function of the energy, and therefore in statistical equilibrium. For 
a change in the external coordinates, we may take a horizontal mo¬ 
tion of the point of suspension. If this is moved a given distance, the 
resulting disturbance of the statistical equilibrium may evidently be 
diminished indefinitely by diminishing the velocity of the point of 
suspension. This will be true if the law of elasticity of the string 
is such that the period of vibration is independent of the energy, 
in which case there is no tendency in the course of time toward a 
state of statistical equilibrium, as well as in the more general case, 
in which there is a tendency toward statistical equilibrium. 

That something of this kind will be true in general, the following 
considerations will tend to show. 

We define a path as the series of phases through which a system 
passes in the course of time when the external coordinates have 
fixed values. When the external coordinates are varied, paths are 
changed. The path of a phase is the path to which that phase 
belongs. With reference to any ensemble of systems we shall denote 
by D\ the average value of the density-in-phase in a path. This 
implies that we have a measure for comparing different portions 
of the path. We shall suppose the time required to traverse any 
portion of a path to be its measure for the purpose of determining 
this average. 

With this understanding, let us suppose that a certain ensemble 
is in statistical equilibrium. In every element of extension-in-phase, 
therefore, the density-in-phase D is equal to its path-average D | . 
Let a sudden small change be made in the external coordinates. 
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The statistical equilibrium will be disturbed and we shall no longer 
have D = D\ p everywhere. This is not because D is changed, but 
because D | is changed, the paths being changed. It is evident that 
if D > D | in a part of a path, we shall have D < D | in other 
parts of the same path. 

Now, if we should imagine a further change in the external 
coordinates of the same kind, we should expect it to produce an 
effect of the same kind. But the manner in which the second ef¬ 
fect will be superposed on the first will be different, according as 
it occurs immediately after the first change or after an interval of 
time. If it occurs immediately after the first change, then in any 
element of phase in which the first change produced a positive value 
of D — D | the second change will add a positive value to the first 
positive value, and where D — D | was negative, the second change 
will add a negative value to the first negative value. 

But if we wait a sufficient time before making the second change 
in the external coordinates, so that systems have passed from ele¬ 
ments of phase in which D — D | was originally positive to elements 
in which it was originally negative, and vice versa, (the systems 
carrying with them the values of D — D | ,) the positive values of 
D — D | caused by the second change will be in part superposed on 
negative values due to the first change, and vice versa. 

The disturbance of statistical equilibrium, therefore, produced 
by a given change in the values of the external coordinates may be 
very much diminished by dividing the change into two parts sep¬ 
arated by a sufficient interval of time, and a sufficient interval of 
time for this purpose is one in which the phases of the individual 
systems are entirely unlike the first, so that any individual system 
is differently affected by the change, although the whole ensem¬ 
ble is affected in nearly the same way. Since there is no limit to 
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the diminution of the disturbance of equilibrium by division of the 
change in the external coordinates, we may suppose as a general 
rule that by diminishing the velocity of the changes in the external 
coordinates, a given change may be made to produce a very small 
disturbance of statistical equilibrium. 

If we write rj' for the value of the average index of probability 
before the variation of the external coordinates, and rj" for the value 
after this variation, we shall have in any case 

T] < Tj 

as the simple result of the variation of the external coordinates. 
This may be compared with the thermodynamic theorem that the 
entropy of a body cannot be diminished by mechanical (as distin¬ 
guished from thermal) action.* 

If we have (approximate) statistical equilibrium between the 
times t' and t" (corresponding to r\ and rj"), we shall have approx¬ 
imately 

— / — // 

V = V , 

which may be compared with the thermodynamic theorem that the 
entropy of a body is not (sensibly) affected by mechanical action, 
during which the body is at each instant (sensibly) in a state of 
thermodynamic equilibrium. 

Approximate statistical equilibrium may usually be attained by 
a sufficiently slow variation of the external coordinates, just as ap¬ 
proximate thermodynamic equilibrium may usually be attained by 
sufficient slowness in the mechanical operations to which the body 
is subject. 

*The correspondences to which the reader’s attention is called are between 
—and entropy, and between 0 and temperature. 
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We now pass to the consideration of the effect on an ensem¬ 
ble of systems which is produced by the action of other ensembles 
with which it is brought into dynamical connection. In a previous 
chapter* we have imagined a dynamical connection arbitrarily cre¬ 
ated between the systems of two ensembles. We shall now regard 
the action between the systems of the two ensembles as a result of 
the variation of the external coordinates, which causes such varia¬ 
tions of the internal coordinates as to bring the systems of the two 
ensembles within the range of each other’s action. 

Initially, we suppose that we have two separate ensembles of sys¬ 
tems, Ei and E 2 . The numbers of degrees of freedom of the systems 
in the two ensembles will be denoted by ri\ and n 2 respectively, and 
the probability-coefficients by e m and e 712 . Now we may regard any 
system of the first ensemble combined with any system of the sec¬ 
ond as forming a single system of n\ + n 2 degrees of freedom. Let 
us consider the ensemble (Ei 2 ) obtained by thus combining each 
system of the first ensemble with each of the second. 

At the initial moment, which may be specified by a single accent, 
the probability-coefficient of any phase of the combined systems is 
evidently the product of the probability-coefficients of the phases of 
which it is made up. This may be expressed by the equation, 



e Vl2 = e ,? i e ,? L 

(455) 

or 


in = ii + i 2 . 

(456) 

which gives 


in = i\ + i 2 1 

(457) 


See Chapter IV, page 41. 
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The forces tending to vary the internal coordinates of the com¬ 
bined systems, together with those exerted by either system upon 
the bodies represented by the coordinates called external, may be 
derived from a single force-function, which, taken negatively, we 
shall call the potential energy of the combined systems and denote 
by € 12 ■ But we suppose that initially none of the systems of the two 
ensembles E 1 and E 2 come within range of each other’s action, so 
that the potential energy of the combined system falls into two parts 
relating separately to the systems which are combined. The same 
is obviously true of the kinetic energy of the combined compound 
system, and therefore of its total energy. This may be expressed by 
the equation 

e i2 = e i + (458) 

which gives 

^12 — ^1 + ^2- (459) 

Let us now suppose that in the course of time, owing to the 
motion of the bodies represented by the coordinates called external, 
the forces acting on the systems and consequently their positions 
are so altered, that the systems of the ensembles E\ and E 2 are 
brought within range of each other’s action, and after such mutual 
influence has lasted for a time, by a further change in the external 
coordinates, perhaps a return to their original values, the systems 
of the two original ensembles are brought again out of range of each 
other’s action. Finally, then, at a time specified by double accents, 
we shall have as at first 


— // _ -// , -// 

e 12 ~ e l + e 2‘ 


( 460 ) 
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But for the indices of probability we must write* 

v"+rfi<rf ; 2 . (461) 

The considerations adduced in the last chapter show that it is 
safe to write 

V12 — Vi-2- (462) 

We have therefore 

Vi + 4 i < v'l + r?2, ( 463 ) 

which may be compared with the thermodynamic theorem that the 
thermal contact of two bodies may increase but cannot diminish the 
sum of their entropies. 

Let us especially consider the case in which the two original 
ensembles were both canonically distributed in phase with the re¬ 
spective moduli @i and @ 2 . We have then, by Theorem 111 of Chap¬ 
ter XI, 



* + el ~ + ©T- 

(464) 


e' f" 

—1, t 2 ^ — // , t 2 

' h + el s 1,2 + el' 

(465) 

Whence with (463) 

we have 



e\ e' 2 e'{ e 2 

-1-<-1- 

©1 ©2 “ 01 ©2 

(466) 

or 




e" - e( e" - e' 

1 1 + 2 2 > 0. 

©1 ©2 - 

(467) 


See Chapter XI, Theorem VII. 
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If we write W for the average work done by the combined systems 
on the external bodies, we have by the principle of the conservation 
of energy 



(468) 


Now if W is negligible, we have 




(469) 


and (467) shows that the ensemble which has the greater modulus 
must lose energy. This result may be compared to the thermody¬ 
namic principle, that when two bodies of different temperatures are 
brought together, that which has the higher temperature will lose 
energy. 

Let us next suppose that the ensemble E 2 is originally canoni¬ 
cally distributed with the modulus 0o, but leave the distribution of 
the other arbitrary. We have, to determine the result of a similar 
process, 


Vi + V'l < Vi + ' { L 



Hence 



(470) 


which may be written 



(471) 


This may be compared with the thermodynamic principle that when 
a body (which need not be in thermal equilibrium) is brought into 
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thermal contact with another of a given temperature, the increase 
of entropy of the first cannot be less (algebraically) than the loss 
of heat by the second divided by its temperature. Where W is 
negligible, we may write 

*' + !<* + !. (472) 

Now, by Theorem 111 of Chapter XI, the quantity 

5?i + C (473) 

has a minimum value when the ensemble to which rj l and €\ relate 
is distributed canonically with the modulus © 2 - If the ensemble 
had originally this distribution, the sign < in (472) would be im¬ 
possible. In fact, in this case, it would be easy to show that the 
preceding formulae on which (472) is founded would all have the 
sign =. But when the two ensembles are not both originally dis¬ 
tributed canonically with the same modulus, the formulae indicate 
that the quantity (473) may be diminished by bringing the ensem¬ 
ble to which 6i and r)\ relate into connection with another which 
is canonically distributed with modulus @ 2 , and therefore, that by 
repeated operations of this kind the ensemble of which the original 
distribution was entirely arbitrary might be brought approximately 
into a state of canonical distribution with the modulus @ 2 . We 
may compare this with the thermodynamic principle that a body 
of which the original thermal state may be entirely arbitrary, may 
be brought approximately into a state of thermal equilibrium with 
any given temperature by repeated connections with other bodies 
of that temperature. 

Let us now suppose that we have a certain number of ensembles, 
E 0 , Ei, E- 2 , etc., distributed canonically with the respective moduli 
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@o, 0i, © 2 , etc. By variation of the external coordinates of the en¬ 
semble Eq. let it be brought into connection with E \, and then let 
the connection be broken. Let it then be brought into connection 
with E 2 , and then let that connection be broken. Let this process 
be continued with respect to the remaining ensembles. We do not 
make the assumption, as in some cases before, that the work con¬ 
nected with the variation of the external coordinates is a negligible 
quantity. On the contrary, we wish especially to consider the case 
in which it is large. In the final state of the ensemble Eq, let us 
suppose that the external coordinates have been brought back to 
their original values, and that the average energy (eo) is the same 
as at first. 

In our usual notations, using one and two accents to distinguish 
original and final values, we get by repeated applications of the 
principle expressed in (463) 


Vo + Vi + V 2 + etc. > Vo + Vi + V 2 + etc. (474) 


But by Theorem III of Chapter XI, 



(475) 


(476) 


(477) 


Hence 



( 478 ) 


STATISTICAL MECHANICS 


191 


or, since 


i - f" 
fc 0 — fc 0> 

f' _ f" f! _ f" 

0 > ^ + etc. (479) 

©1 ©2 

If we write W for the average work done on the bodies represented 
by the external coordinates, we have 

e[ - e" + d 2 - e" + etc. = W. (480) 

If Eq, Ei, and E 2 are the only ensembles, we have 

(48i) 

It will be observed that the relations expressed in the last three for¬ 
mulae between W, e\ — e ", e' 2 — 4, etc., and ©i, 0 2 , etc. are precisely 
those which hold in a Carnot’s cycle for the work obtained, the en¬ 
ergy lost by the several bodies which serve as heaters or coolers, 
and their initial temperatures. 

It will not escape the reader’s notice, that while from one point 
of view the operations which are here described are quite beyond 
our powers of actual performance, on account of the impossibility 
of handling the immense number of systems which are involved, yet 
from another point of view the operations described are the most 
simple and accurate means of representing what actually takes place 
in our simplest experiments in thermodynamics. The states of the 
bodies which we handle are certainly not known to us exactly. What 
we know about a body can generally be described most accurately 
and most simply by saying that it is one taken at random from a 
great number (ensemble) of bodies which are completely described. 
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If we bring it into connection with another body concerning which 
we have a similar limited knowledge, the state of the two bodies is 
properly described as that of a pair of bodies taken from a great 
number (ensemble) of pairs which are formed by combining each 
body of the first ensemble with each of the second. 

Again, when we bring one body into thermal contact with an¬ 
other, for example, in a Carnot’s cycle, when we bring a mass of 
fluid into thermal contact with some other body from which we wish 
it to receive heat, we may do it by moving the vessel containing the 
fluid. This motion is mathematically expressed by the variation of 
the coordinates which determine the position of the vessel. We al¬ 
low ourselves for the purposes of a theoretical discussion to suppose 
that the walls of this vessel are incapable of absorbing heat from 
the fluid. Yet while we exclude the kind of action which we call 
thermal between the fluid and the containing vessel, we allow the 
kind which we call work in the narrower sense, which takes place 
when the volume of the fluid is changed by the motion of a piston. 
This agrees with what we have supposed in regard to the external 
coordinates, which we may vary in any arbitrary manner, and are 
in this entirely unlike the coordinates of the second ensemble with 
which we bring the first into connection. 

When heat passes in any thermodynamic experiment between 
the fluid principally considered and some other body, it is actu¬ 
ally absorbed and given out by the walls of the vessel, which will 
retain a varying quantity. This is, however, a disturbing circum¬ 
stance, which we suppose in some way made negligible, and actu¬ 
ally neglect in a theoretical discussion. In our case, we suppose 
the walls incapable of absorbing energy, except through the motion 
of the external coordinates, but that they allow the systems which 
they contain to act directly on one another. Properties of this kind 
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are mathematically expressed by supposing that in the vicinity of 
a certain surface, the position of which is determined by certain 
(external) coordinates, particles belonging to the system in ques¬ 
tion experience a repulsion from the surface increasing so rapidly 
with nearness to the surface that an infinite expenditure of energy 
would be required to carry them through it. It is evident that two 
systems might be separated by a surface or surfaces exerting the 
proper forces, and yet approach each other closely enough to exert 
mechanical action on each other. 



XIV. 


DISCUSSION OF THERMODYNAMIC ANALOGIES. 

If we wish to find in rational mechanics an a priori foundation for 
the principles of thermodynamics, we must seek mechanical defini¬ 
tions of temperature and entropy. The quantities thus defined must 
satisfy (under conditions and with limitations which again must be 
specified in the language of mechanics) the differential equation 

de — T dp — A± dai — A 2 da 2 — etc., (482) 

where e, T, and 77 denote the energy, temperature, and entropy of 
the system considered, and Aida±, etc., the mechanical work (in 
the narrower sense in which the term is used in thermodynamics, 
i.e., with exclusion of thermal action) done upon external bodies. 

This implies that we are able to distinguish in mechanical terms 
the thermal action of one system on another from that which we 
call mechanical in the narrower sense, if not indeed in every case in 
which the two may be combined, at least so as to specify cases of 
thermal action and cases of mechanical action. 

Such a differential equation moreover implies a finite equation 
between e, 77, and aq, a 2 , etc., which may be regarded as funda¬ 
mental in regard to those properties of the system which we call 
thermodynamic, or which may be called so from analogy. This 
fundamental thermodynamic equation is determined by the fun¬ 
damental mechanical equation which expresses the energy of the 
system as function of its momenta and coordinates with those ex¬ 
ternal coordinates ( 01 , a 2 , etc.) which appear in the differential ex¬ 
pression of the work done on external bodies. We have to show the 
mathematical operations by which the fundamental thermodynamic 
equation, which in general is an equation of few variables, is derived 
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from the fundamental mechanical equation, which in the case of the 
bodies of nature is one of an enormous number of variables. 

We have also to enunciate in mechanical terms, and to prove, 
what we call the tendency of heat to pass from a system of higher 
temperature to one of lower, and to show that this tendency vanishes 
with respect to systems of the same temperature. 

At least, we have to show by a priori reasoning that for such 
systems as the material bodies which nature presents to us, these 
relations hold with such approximation that they are sensibly true 
for human faculties of observation. This indeed is all that is re¬ 
ally necessary to establish the science of thermodynamics on an 
a priori basis. Yet we will naturally desire to find the exact ex¬ 
pression of those principles of which the laws of thermodynamics 
are the approximate expression. A very little study of the statisti¬ 
cal properties of conservative systems of a finite number of degrees 
of freedom is sufficient to make it appear, more or less distinctly, 
that the general laws of thermodynamics are the limit toward which 
the exact laws of such systems approximate, when their number of 
degrees of freedom is indefinitely increased. And the problem of 
finding the exact relations, as distinguished from the approximate, 
for systems of a great number of degrees of freedom, is practically 
the same as that of finding the relations which hold for any number 
of degrees of freedom, as distinguished from those which have been 
established on an empirical basis for systems of a great number of 
degrees of freedom. 

The enunciation and proof of these exact laws, for systems of 
any finite number of degrees of freedom, has been a principal object 
of the preceding discussion. But it should be distinctly stated that, 
if the results obtained when the numbers of degrees of freedom are 
enormous coincide sensibly with the general laws of thcrmodynam- 
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ics, however interesting and significant this coincidence may be, we 
are still far from having explained the phenomena of nature with 
respect to these laws. For, as compared with the case of nature, 
the systems which we have considered are of an ideal simplicity. 
Although our only assumption is that we are considering conserva¬ 
tive systems of a finite number of degrees of freedom, it would seem 
that this is assuming far too much, so far as the bodies of nature 
are concerned. The phenomena of radiant heat, which certainly 
should not be neglected in any complete system of thermodynam¬ 
ics, and the electrical phenomena associated with the combination 
of atoms, seem to show that the hypothesis of systems of a finite 
number of degrees of freedom is inadequate for the explanation of 
the properties of bodies. 

Nor do the results of such assumptions in every detail appear 
to agree with experience. We should expect, for example, that a 
diatomic gas, so far as it could be treated independently of the 
phenomena of radiation, or of any sort of electrical manifestations, 
would have six degrees of freedom for each molecule. But the be¬ 
havior of such a gas seems to indicate not more than five. 

But although these difficulties, long recognized by physicists,* 
seem to prevent, in the present state of science, any satisfactory 
explanation of the phenomena of thermodynamics as presented to 
us in nature, the ideal case of systems of a finite number of degrees 
of freedom remains as a subject which is certainly not devoid of a 
theoretical interest, and which may serve to point the way to the 
solution of the far more difficult problems presented to us by nature. 
And if the study of the statistical properties of such systems gives 
us an exact expression of laws which in the limiting case take the 
form of the received laws of thermodynamics, its interest is so much 

*See Boltzmann, Sitzb. der Wiener Akad., Bd. LXIII, S. 418, (1871). 
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the greater. 

Now we have defined what we have called the modulus (0) of 
an ensemble of systems canonically distributed in phase, and what 
we have called the index of probability (rj) of any phase in such an 
ensemble. It has been shown that between the modulus (0), the 
external coordinates (cq, etc.), and the average values in the ensem¬ 
ble of the energy (e), the index of probability (rj), and the external 
forces (Ai, etc.) exerted by the systems, the following differential 
equation will hold: 

de = —0 drj — A\ da\ — A 2 da 2 — etc. (483) 

This equation, if we neglect the sign of averages, is identical in form 
with the thermodynamic equation (482), the modulus (0) corre¬ 
sponding to temperature, and the index of probability of phase with 
its sign reversed corresponding to entropy.* 

We have also shown that the average square of the anomalies of e, 
that is, of the deviations of the individual values from the average, 
is in general of the same order of magnitude as the reciprocal of the 
number of degrees of freedom, and therefore to human observation 
the individual values are indistinguishable from the average values 
when the number of degrees of freedom is very great7 In this case 
also the anomalies of r] are practically insensible. The same is true 
of the anomalies of the external forces (A\, etc.), so far as these 
are the result of the anomalies of energy, so that when these forces 
are sensibly determined by the energy and the external coordinates, 
and the number of degrees of freedom is very great, the anomalies 
of these forces are insensible. 


*See Chapter IV, pages 49, 51. 
i’See Chapter VII, pages 85-87. 
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The mathematical operations by which the finite equation be¬ 
tween e, rj, and cq, etc., is deduced from that which gives the 
energy (e) of a system in terms of the momenta (p±, ...p n ) and 
coordinates both internal (qi, ...q n ) and external (ai, etc.), are 
indicated by the equation 


_± 

e © = 


/ all 
phases 


€_ 

e~ © dqi . .. dq n dp\ ... dp n , 


(484) 


where 

-0 = Qfj + e. 

We have also shown that when systems of different ensembles 
are brought into conditions analogous to thermal contact, the aver¬ 
age result is a passage of energy from the ensemble of the greater 
modulus to that of the less,* * or in case of equal moduli, that we have 
a condition of statistical equilibrium in regard to the distribution 
of energy.* 

Propositions have also been demonstrated analogous to those in 
thermodynamics relating to a Carnot’s cycle,* or to the tendency of 
entropy to increase,^ especially when bodies of different temperature 
are brought into contact.^ 

We have thus precisely defined quantities, and rigorously demon¬ 
strated propositions, which hold for any number of degrees of free¬ 
dom, and which, when the number of degrees of freedom (n) is 
enormously great, would appear to human faculties as the quanti¬ 
ties and propositions of empirical thermodynamics. 

*See Chapter XIII, page 187. 

*See Chapter IV, pages 39-41. 

*See Chapter XIII, pages 189, 191. 

§ See Chapter XII, pages 168-177. 

^See Chapter XIII, page 186. 
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It is evident, however, that there may be more than one quantity 
defined for finite values of n, which approach the same limit, when 
n is increased indefinitely, and more than one proposition relating to 
finite values of n, which approach the same limiting form for n = oo. 
There may be therefore, and there are, other quantities which may 
be thought to have some claim to be regarded as temperature and 
entropy with respect to systems of a finite number of degrees of 
freedom. 

The definitions and propositions which we have been considering 
relate essentially to what we have called a canonical ensemble of 
systems. This may appear a less natural and simple conception 
than what we have called a microcanonical ensemble of systems, in 
which all have the same energy, and which in many cases represents 
simply the time-ensemble, or ensemble of phases through which a 
single system passes in the course of time. 

It may therefore seem desirable to find definitions and proposi¬ 
tions relating to these microcanonical ensembles, which shall corre¬ 
spond to what in thermodynamics are based on experience. Now 
the differential equation 

de = e _9 V dlog V — A\ | e da\ — A 2 | e da 2 — etc., (485) 

which has been demonstrated in Chapter X, and which relates to 
a microcanonical ensemble, A] | e denoting the average value of A± 
in such an ensemble, corresponds precisely to the thermodynamic 
equation, except for the sign of average applied to the external 
forces. But as these forces are not entirely determined by the energy 
with the external coordinates, the use of average values is entirely 
germane to the subject, and affords the readiest means of getting 
perfectly determined quantities. These averages, which are taken 
for a microcanonical ensemble, may seem from some points of view 
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a more simple and natural conception than those which relate to 
a canonical ensemble. Moreover, the energy, and the quantity cor¬ 
responding to entropy, are free from the sign of average in this 
equation. 

The quantity in the equation which corresponds to entropy 
is logy, the quantity V being defined as the extension-in-phase 
within which the energy is less than a certain limiting value (e). 
This is certainly a more simple conception than the average value in 
a canonical ensemble of the index of probability of phase. Log V has 
the property that when it is constant 

de = — A\ | e dai — A 2 | e da 2 + etc., (486) 

which closely corresponds to the thermodynamic property of en¬ 
tropy, that when it is constant 

de = —A] da i — A 2 da 2 + etc. (487) 

The quantity in the equation which corresponds to temperature is 
e~^V, or de/dlogV. In a canonical ensemble, the average value of 
this quantity is equal to the modulus, as has been shown by different 
methods in Chapters IX and X. 

In Chapter X it has also been shown that if the systems of 
a microcanonical ensemble consist of parts with separate energies, 
the average value of e~^V or any part is equal to its average value 
for any other part, and to the uniform value of the same expression 
for the whole ensemble. This corresponds to the theorem in the 
theory of heat that in case of thermal equilibrium the temperatures 
of the parts of a body are equal to one another and to that of the 
whole body. Since the energies of the parts of a body cannot be 
supposed to remain absolutely constant, even where this is the case 
with respect to the whole body, it is evident that if we regard the 
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temperature as a function of the energy, the taking of average or 
of probable values, or some other statistical process, must be used 
with reference to the parts, in order to get a perfectly definite value 
corresponding to the notion of temperature. 

It is worthy of notice in this connection that the average value 
of the kinetic energy, either in a microcanonical ensemble, or in a 
canonical, divided by one half the number of degrees of freedom, is 
equal to e~^V, or to its average value, and that this is true not only 
of the whole system which is distributed either microcanonically 
or canonically, but also of any part, although the corresponding 
theorem relating to temperature hardly belongs to empirical ther¬ 
modynamics, since neither the (inner) kinetic energy of a body, nor 
its number of degrees of freedom is immediately cognizable to our 
faculties, and we meet the gravest difficulties when we endeavor to 
apply the theorem to the theory of gases, except in the simplest 
case, that of the gases known as monatomic. 

But the correspondence between e~^V or de/d log Id and tem¬ 
perature is imperfect. If two isolated systems have such energies 
that 

de i de 2 

d log V\ d log V 2 ’ 

and the two systems are regarded as combined to form a third sys¬ 
tem with energy 

= Cl + 62, 

we shall not have in general 

de 12 de 1 de 2 

d log V 12 d log Vi d log V 2 ’ 

as analogy with temperature would require. In fact, we have seen 
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that _ _ 

dd \2 d(\ de 2 

d log Vj 2 d log Vi ei2 dlogC 2 ei2 ’ 

where the second and third members of the equation denote average 
values in an ensemble in which the compound system is microcanon- 
ically distributed in phase. Let us suppose the two original systems 
to be identical in nature. Then 


ei - e 2 - ei| ei2 - e 2 j ei2 . 

The equation in question would require that 

de i dt\ 

d log V, d log V, ei2 ’ 

i.e., that we get the same result, whether we take the value of 
de i / d: log Vi determined for the average value of e\ in the ensemble, 
or take the average value of de\/d log V\. This will be the case 
where dei /dlogVi is a linear function of e x . Evidently this does 
not constitute the most general case. Therefore the equation in 
question cannot be true in general. It is true, however, in some 
very important particular cases, as when the energy is a quadratic 
function of the p’s and g’s, or of the p’s alone.* When the equation 
holds, the case is analogous to that of bodies in thermodynamics 
for which the specific heat for constant volume is constant. 

Another quantity which is closely related to temperature is 
defr/de. It has been shown in Chapter IX that in a canonical en¬ 
semble, if n > 2, the average value of dcp/de is 1/0, and that the 

*This last case is important on account of its relation to the theory of gases, 
although it must in strictness be regarded as a limit of possible cases, rather 
than as a case which is itself possible. 
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most common value of the energy in the ensemble is that for which 
d(j)/de =1/0. The first of these properties may be compared with 
that of de/d log V, which has been seen to have the average value 0 
in a canonical ensemble, without restriction in regard to the number 
of degrees of freedom. 

With respect to microcanonical ensembles also, d(f)/de has a 
property similar to what has been mentioned with respect to 
de/d log Id. That is, if a system microcanonically distributed in 
phase consists of two parts with separate energies, and each with 
more than two degrees of freedom, the average values in the ensem¬ 
ble of d(f>/de for the two parts are equal to one another and to the 
value of same expression for the whole. In our usual notations 


d(j) i 
de i 


£12 



d</> 12 
de 12 


if n i > 2, and n 2 > 2. 

This analogy with temperature has the same incompleteness 
which was noticed with respect to de/dlogV, viz., if two systems 
have such energies (ei and e 2 ) that 


dc/i d(f> 2 
de\ de 2 

and they are combined to form a third system with energy 


^i2 — + e 2 , 


we shall not have in general 

dcf) 12 dxpi d(p2 
de 12 de\ de 2 
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Thus, if the energy is a quadratic function of the p’s and q’s, we 
have* 

dcj) i ri\ — 1 d(f> 2 ri 2 — 1 

rfei ei ’ de 2 e 2 
d(j) 12 _ 7ii2 - 1 _ ni + n 2 — 1 
de 12 ei 2 + ^2 

where 77i, 772, 7712 , are the numbers of degrees of freedom of the 
separate and combined systems. But 

dcj) 1 d(f>2 '77 1 + 77 2 — 2 

dei de 2 ei + e 2 

If the energy is a quadratic function of the p’s alone, the case would 
be the same except that we should have |t7 15 \n 2 , \n\ 2 , instead of 
77i, 77 2 , 77 12 . In these particular cases, the analogy between de/d log V 
and temperature would be complete, as has already been remarked. 
We should have 

de i 6i de 2 e 2 

d log Vi 77 1 ’ d log V 2 n 2 ’ 

de 12 _ e 12 _ dei _ de 2 

d logVi2 7712 d log V\ d log V 2 ’ 

when the energy is a quadratic function of the p’s and g’s, and 
similar equations with \n 2 , |t7 12 , instead of 77 l5 n 2 , 77 12 , when 
the energy is a quadratic function of the p’s alone. 

More characteristic of d<fi/de are its properties relating to most 
probable values of energy. If a system having two parts with sep¬ 
arate energies and each with more than two degrees of freedom is 

*See foot-note on page 109. We have here made the least value of the energy 
consistent with the values of the external coordinates zero instead of e a , as is 
evidently allowable when the external coordinates are supposed invariable. 
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microcanonically distributed in phase, the most probable division 
of energy between the parts, in a system taken at random from the 
ensemble, satisfies the equation 

d(f) i _ dfo 
dd\ da 2 

which corresponds to the thermodynamic theorem that the distri¬ 
bution of energy between the parts of a system, in case of thermal 
equilibrium, is such that the temperatures of the parts are equal. 

To prove the theorem, we observe that the fractional part of the 
whole number of systems which have the energy of one part (ei) 
between the limits e\ and e" is expressed by 

e ~tl2 / e 4>l+<p2 deu 

Jt'l 

where the variables are connected by the equation 



ei + £2 = constant = ei 2 - 

The greatest value of this expression, for a constant infinitesimal 
value of the difference e" — e \, determines a value of e 1 , which we 
may call its most probable value. This depends on the greatest 
possible value of 0i + 02- Now if n\ > 2, and n 2 > 2, we shall 
have 0i = —oo for the least possible value of ei, and 02 = — oo 
for the least possible value of e 2 . Between these limits 0i and 0 2 
will be finite and continuous. Hence 0i + 0 2 will have a maximum 
satisfying the equation (488). 

But if rti < 2, or ri 2 < 2, d(f)i/dei or d0 2 / 2 e 2 may be negative, or 
zero, for all values of ei or 62 , and can hardly be regarded as having 
properties analogous to temperature. 
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It is also worthy of notice that if a system which is microcanon- 
ically distributed in phase has three parts with separate energies, 
and each with more than two degrees of freedom, the most probable 
division of energy between these parts satisfies the equation 

dcj) i d(f) 2 dcj) 3 

de i de 2 de 3 

That is, this equation gives the most probable set of values of e 1, e 2 , 
and e 3 . But it does not give the most probable value of ei, or of e 2 , or 
of e 3 . Thus, if the energies are quadratic functions of the p’s and g’s, 
the most probable division of energy is given by the equation 

ni — 1 n 2 — 1 n 3 — 1 
Cl C2 e 3 

But the most probable value of is given by 

ni - 1 _ n 2 + n 3 - 1 
£i e 2 + e 3 

while the preceding equations give 

ra 1 — 1 _ n 2 + n 3 - 2 
6i e 2 + e 3 

These distinctions vanish for very great values of n \, n 2 , n 3 . For 
small values of these numbers, they are important. Such facts seem 
to indicate that the consideration of the most probable division of 
energy among the parts of a system does not afford a convenient 
foundation for the study of thermodynamic analogies in the case of 
systems of a small number of degrees of freedom. The fact that a 
certain division of energy is the most probable has really no especial 
physical importance, except when the ensemble of possible divisions 
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are grouped so closely together that the most probable division may 
fairly represent the whole. This is in general the case, to a very close 
approximation, when n is enormously great; it entirely fails when 
n is small. 

If we regard d(j)/de as corresponding to the reciprocal of temper¬ 
ature, or, in other words, de/dcj) as corresponding to temperature, 
0 will correspond to entropy. It has been defined as log jdV/de). In 
the considerations on which its definition is founded, it is therefore 
very similar to log V. We have seen that dcj)/d log V approaches the 
value unity when n is very great.* 

To form a differential equation on the model of the thermody¬ 
namic equation (482), in which de/dcj) shall take the place of tem¬ 
perature, and 0 of entropy, we may write 

de = dcj)+(-^~da 2 + e tc., (489) 

\d(j)J a \daij^ a \da 2 J^ a 

or 

d(f> = -f- de + dai + - 7 ^- da 2 + etc. (490) 

dc d&\ d&2 

With respect to the differential coefficients in the last equation, 
which corresponds exactly to (482) solved with respect to dr), we 
have seen that their average values in a canonical ensemble are 
equal to 1/0, and the averages of A\/Q, A 2 /Q, etc/ We have also 
seen that de/dcj) (or dcj>/de) has relations to the most probable values 
of energy in parts of a microcanonical ensemble. That ( de/dai )^ a , 
etc., have properties somewhat analogous, may be shown as follows. 

In a physical experiment, we measure a force by balancing it 
against another. If we should ask what force applied to increase or 

*See Chapter X, pages 141, 142. 

i'See Chapter IX, equations (321), (327). 
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diminish cq would balance the action of the systems, it would be 
one which varies with the different systems. But we may ask what 
single force will make a given value of cq the most probable, and we 
shall find that under certain conditions (de/dai)^^ represents that 
force. 

To make the problem definite, let us consider a system con¬ 
sisting of the original system together with another having the 
coordinates cq, a 2 , etc., and forces A \, A' 2 , etc., tending to in¬ 
crease those coordinates. These are in addition to the forces A\, 
A 2 , etc., exerted by the original system, and are derived from a 
force-function (— e' q ) by the equations 


A 


de' q 
(I a \ 


A ' 0 



etc. 


For the energy of the whole system we may write 


E = e + e’ q + \m\d\ + \m 2 d \ + etc., 
and for the extension-in-phase of the whole system within any limits 


dpi... dq n da\ rn 1 ddi da 2 m 2 dci 2 ..., 


or 



de da 1 rn 1 da 1 da 2 m 2 dd 2 




or again 



dE da\ mi da 1 da 2 m 2 dd 2 .. 


5 


since de = dE , when cq, hi, a 2 , a 2 , etc., are constant. If the limits 
are expressed by E and E + dE , <q and ai + da j, di and hi + da 1, 
etc., the integral reduces to 


e^ dE da\ mi da 1 da 2 m 2 da 2 .... 
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The values of oq, di, a 2 , d 2 , etc., which make this expression a 
maximum for constant values of the energy of the whole system 
and of the differentials dE, da±, ddi, etc., are what may be called 
the most probable values of ai, di, etc., in an ensemble in which the 
whole system is distributed microcanonically. 

To determine these values we have 

de^ = 0, 


when 
That is, 
when 


d(e + e' q + \m\a\ + ^m 2 a 2 2 + etc.) = 0. 
d<p = 0, 


( — ) d(f> + ( —— | da i — A\ dai + etc. + m idi da, i + etc. = 0 . 
\d<t>J a \ da iJ<p,a 


This requires 
and 


di = 0, a 2 = 0, etc. 


( de 


\dai 


( de 


^ = 4 , ^ = 4 . 


(j),a 


\dCL 2 


<j),a 


This shows that for any given values of E, Gp, a 2 , etc. ( —— 

\ da W 4>,a 

-) , etc., represent the forces (in the generalized sense) which 

d(, Jo.n 

the external bodies would have to exert to make these values of a-i, 
a 2 , etc., the most probable under the conditions specified. When the 
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differences of the external forces which are exerted by the different 
systems are negligible, —(de/dai)^ a etc., represent these forces. 

It is certainly in the quantities relating to a canonical ensemble, 
e, 0, rj, Ai, etc., cq, etc., that we find the most complete correspon¬ 
dence with the quantities of the thermodynamic equation ( 482 ). 
Yet the conception itself of the canonical ensemble may seem to 
some artificial, and hardly germane to a natural exposition of the 

subject; and the quantities e, ———, log 14 , A\\. etc., aq, etc., 

alogT 


or e, — ,</>, ( -— , etc., aq, etc., which are closely related to 

d( P \ da i/^a 

ensembles of constant energy, and to average and most probable 
values in such ensembles, and most of which are defined without 
reference to any ensemble, may appear the most natural analogues 
of the thermodynamic quantities. 

In regard to the naturalness of seeking analogies with the ther¬ 
modynamic behavior of bodies in canonical or microcanonical en¬ 
sembles of systems, much will depend upon how we approach the 
subject, especially upon the question whether we regard energy or 
temperature as an independent variable. 

It is very natural to take energy for an independent variable 
rather than temperature, because ordinary mechanics furnishes us 
with a perfectly defined conception of energy, whereas the idea of 
something relating to a mechanical system and corresponding to 
temperature is a notion but vaguely defined. Now if the state of 
a system is given by its energy and the external coordinates, it 
is incompletely defined, although its partial definition is perfectly 
clear as far as it goes. The ensemble of phases microcanonically 
distributed, with the given values of the energy and the external 
coordinates, will represent the imperfectly defined system better 
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than any other ensemble or single phase. When we approach the 
subject from this side, our theorems will naturally relate to average 
values, or most probable values, in such ensembles. 

In this case, the choice between the variables of (485) or of (489) 
will be determined partly by the relative importance which is at¬ 
tached to average and probable values. It would seem that in gen¬ 
eral average values are the most important, and that they lend 
themselves better to analytical transformations. This consideration 
would give the preference to the system of variables in which log V 
is the analogue of entropy. Moreover, if we make 0 the analogue of 
entropy, we are embarrassed by the necessity of making numerous 
exceptions for systems of one or two degrees of freedom. 

On the other hand, the definition of 0 may be regarded as a lit¬ 
tle more simple than that of log V, and if our choice is determined 
by the simplicity of the definitions of the analogues of entropy and 
temperature, it would seem that the 0 system should have the pref¬ 
erence. In our definition of these quantities, V was defined first, 
and derived from V by differentiation. This gives the relation 
of the quantities in the most simple analytical form. Yet so far as 
the notions are concerned, it is perhaps more natural to regard v 
as derived from e^ by integration. At all events, may be defined 
independently of V, and its definition may be regarded as more 
simple as not requiring the determination of the zero from which 
V is measured, which sometimes involves questions of a delicate na¬ 
ture. In fact, the quantity may exist, when the definition of V 
becomes illusory for practical purposes, as the integral by which it 
is determined becomes infinite. 

The case is entirely different, when we regard the temperature 
as an independent variable, and we have to consider a system which 
is described as having a certain temperature and certain values for 


STATISTICAL MECHANICS 


212 


the external coordinates. Here also the state of the system is not 
completely defined, and will be better represented by an ensemble 
of phases than by any single phase. What is the nature of such an 
ensemble as will best represent the imperfectly defined state? 

When we wish to give a body a certain temperature, we place it 
in a bath of the proper temperature, and when we regard what we 
call thermal equilibrium as established, we say that the body has 
the same temperature as the bath. Perhaps we place a second body 
of standard character, which we call a thermometer, in the bath, 
and say that the first body, the bath, and the thermometer, have 
all the same temperature. 

But the body under such circumstances, as well as the bath, and 
the thermometer, even if they were entirely isolated from external 
influences (which it is convenient to suppose in a theoretical discus¬ 
sion), would be continually changing in phase, and in energy as well 
as in other respects, although our means of observation are not fine 
enough to perceive these variations. 

The series of phases through which the whole system runs in the 
course of time may not be entirely determined by the energy, but 
may depend on the initial phase in other respects. In such cases the 
ensemble obtained by the microcanonical distribution of the whole 
system, which includes all possible time-ensembles combined in the 
proportion which seems least arbitrary, will represent better than 
any one time-ensemble the effect of the bath. Indeed a single time- 
ensemble, when it is not also a microcanonical ensemble, is too 
ill-defined a notion to serve the purposes of a general discussion. 
We will therefore direct our attention, when we suppose the body 
placed in a bath, to the microcanonical ensemble of phases thus 
obtained. 

If we now suppose the quantity of the substance forming the 
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bath to be increased, the anomalies of the separate energies of the 
body and of the thermometer in the microcanonical ensemble will 
be increased, but not without limit. The anomalies of the energy of 
the bath, considered in comparison with its whole energy, diminish 
indefinitely as the quantity of the bath is increased, and become 
in a sense negligible, when the quantity of the bath is sufficiently 
increased. The ensemble of phases of the body, and of the ther¬ 
mometer, approach a standard form as the quantity of the bath is 
indefinitely increased. This limiting form is easily shown to be what 
we have described as the canonical distribution. 

Let us write e for the energy of the whole system consisting of 
the body first mentioned, the bath, and the thermometer (if any), 
and let us first suppose this system to be distributed canonically 
with the modulus 0. We have by (205) 


and since 



0 2 


de 

dS' 



de n de 


dQ 2 de p 

If we write Ae for the anomaly of mean square, we have 


(Ae) 2 = (e - e)L 


If we set 


A0 = ^ de, 
de 


A0 will represent approximately the increase of 0 which would 
produce an increase in the average value of the energy equal to its 
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anomaly of mean square. Now these equations give 

2© 2 dl p 
n dl ’ 

which shows that we may diminish A© indefinitely by increasing 
the quantity of the bath. 

Now our canonical ensemble consists of an infinity of micro- 
canonical ensembles, which differ only in consequence of the differ¬ 
ent values of the energy which is constant in each. If we consider 
separately the phases of the first body which occur in the canonical 
ensemble of the whole system, these phases will form a canonical 
ensemble of the same modulus. This canonical ensemble of phases 
of the first body will consist of parts which belong to the different 
microcanonical ensembles into which the canonical ensemble of the 
whole system is divided. 

Let us now imagine that the modulus of the principal canonical 
ensemble is increased by 2 A©, and its average energy by 2Ae. 
The modulus of the canonical ensemble of the phases of the first 
body considered separately will be increased by 2 A©. We may 
regard the infinity of microcanonical ensembles into which we have 
divided the principal canonical ensemble as each having its energy 
increased by 2 Ae. Let us see how the ensembles of phases of the 
first body contained in these microcanonical ensembles are affected. 
We may assume that they will all be affected in about the same 
way, as all the differences which come into account may be treated 
as small. Therefore, the canonical ensemble formed by taking them 
together will also be affected in the same way. But we know how 
this is affected. It is by the increase of its modulus by 2 A©, a 
quantity which vanishes when the quantity of the bath is indefinitely 
increased. 
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In the case of an infinite bath, therefore, the increase of the en¬ 
ergy of one of the microcanonical ensembles by 2 Ae, produces a 
vanishing effect on the distribution in energy of the phases of the 
first body which it contains. But 2 Ae is more than the average 
difference of energy between the microcanonical ensembles. The 
distribution in energy of these phases is therefore the same in the 
different microcanonical ensembles, and must therefore be canoni¬ 
cal, like that of the ensemble which they form when taken together.* 
As a general theorem, the conclusion may be expressed in the 
words:—If a system of a great number of degrees of freedom is 
microcanonically distributed in phase, any very small part of it may 
be regarded as canonically distributed.^ 

It would seem, therefore, that a canonical ensemble of phases is 
what best represents, with the precision necessary for exact mathe¬ 
matical reasoning, the notion of a body with a given temperature, if 
we conceive of the temperature as the state produced by such pro¬ 
cesses as we actually use in physics to produce a given temperature. 
Since the anomalies of the body increase with the quantity of the 
bath, we can only get rid of all that is arbitrary in the ensemble of 
phases which is to represent the notion of a body of a given ternper- 

*In order to appreciate the above reasoning, it should be understood that 
the differences of energy which occur in the canonical ensemble of phases of the 
first body are not here regarded as vanishing quantities. To fix one’s ideas, one 
may imagine that he has the fineness of perception to make these differences 
seem large. The difference between the part of these phases which belong to 
one microcanonical ensemble of the whole system and the part which belongs to 
another would still be imperceptible, when the quantity of the bath is sufficiently 
increased. 

tit is assumed—and without this assumption the theorem would have no 
distinct meaning—that the part of the ensemble considered may be regarded 
as having separate energy. 
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ature by making the bath infinite, which brings ns to the canonical 
distribution. 

A comparison of temperature and entropy with their analogues 
in statistical mechanics would be incomplete without a considera¬ 
tion of their differences with respect to units and zeros, and the 
numbers used for their numerical specification. If we apply the no¬ 
tions of statistical mechanics to such bodies as we usually consider 
in thermodynamics, for which the kinetic energy is of the same or¬ 
der of magnitude as the unit of energy, but the number of degrees of 
freedom is enormous, the values of 0, de/d log V, and de/def) will be 
of the same order of magnitude as 1 jn, and the variable part of fj, 
log V, and ef> will be of the same order of magnitude as n* If these 
quantities, therefore, represent in any sense the notions of tempera¬ 
ture and entropy, they will nevertheless not be measured in units of 
the usual order of magnitude,—a fact which must be borne in mind 
in determining what magnitudes may be regarded as insensible to 
human observation. 

Now nothing prevents our supposing energy and time in our sta¬ 
tistical formulae to be measured in such units as may be convenient 
for physical purposes. But when these units have been chosen, the 
numerical values of 0, de/dlogV, de/def), fj, logl^, ef>, are entirely 
determined,' and in order to compare them with temperature and 
entropy, the numerical values of which depend upon an arbitrary 
unit, we must multiply all values of 0, de/dlogV, de/def by a con¬ 
stant (K), and divide all values of fj, log!/, and ef by the same 

*See equations (124), (288), (289), and (314); also page 125. 

iThe unit of time only affects the last three quantities, and these only by 
an additive constant, which disappears (with the additive constant of entropy), 
when differences of entropy are compared with their statistical analogues. See 
page 20. 
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constant. This constant is the same for all bodies, and depends 
only on the units of temperature and energy which we employ. For 
ordinary units it is of the same order of magnitude as the numbers 
of atoms in ordinary bodies. 

We are not able to determine the numerical value of K , as it 
depends on the number of molecules in the bodies with which we 
experiment. To fix our ideas, however, we may seek an expression 
for this value, based upon very probable assumptions, which will 
show how we would naturally proceed to its evaluation, if our powers 
of observation were fine enough to take cognizance of individual 
molecules. 

If the unit of mass of a monatomic gas contains u atoms, and it 
may be treated as a system of 3 v degrees of freedom, which seems 
to be the case, we have for canonical distribution 

e p = |z/@, 

de p 2 
dO = * V ' 

If we write T for temperature, and c v for the specific heat of the gas 
for constant volume (or rather the limit toward which this specific 
heat tends, as rarefaction is indefinitely increased), we have 

de p 

dT = ° v: 

since we may regard the energy as entirely kinetic. We may set the 
e p of this equation equal to the e p of the preceding, where indeed 
the individual values of which the average is taken would appear to 
human observation as identical. This gives 

d@ 2c„ 

dT = 3z7’ 


(491) 
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whence 

1 2 c v 
T{ = ~3u 

a value recognized by physicists as a constant independent of the 
kind of monatomic gas considered. 

We may also express the value of K in a somewhat different 
form, which corresponds to the indirect method by which physicists 
are accustomed to determine the quantity c v . The kinetic energy 
due to the motions of the centers of mass of the molecules of a mass 
of gas sufficiently expanded is easily shown to be equal to 

§ pv, 

where p and v denote the pressure and volume. The average value 
of the same energy in a canonical ensemble of such a mass of gas is 

|0z/, 

where v denotes the number of molecules in the gas. Equating these 
values, we have 

pv = @z/, (494) 

whence 

1 0 pv 

K = T = z7f' 

Now the laws of Boyle, Charles, and Avogadro may be expressed 
by the equation 

pv = AuT, (496) 

where A is a constant depending only on the units in which energy 
and temperature are measured. 1/K, therefore, might be called the 
constant of the law of Boyle, Charles, and Avogadro as expressed 
with reference to the true number of molecules in a gaseous body. 
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Since such numbers are unknown to us, it is more convenient to 
express the law with reference to relative values. If we denote by M 
the so-called molecular weight of a gas, that is, a number taken from 
a table of numbers proportional to the weights of various molecules 
and atoms, but having one of the values, perhaps the atomic weight 
of hydrogen, arbitrarily made unity, the law of Boyle, Charles, and 
Avogadro may be written in the more practical form 

(497) 

where A 1 is a constant and m the weight of gas considered. It is 
evident that 1 /K is equal to the product of the constant of the law 
in this form and the (true) weight of an atom of hydrogen, or such 
other atom or molecule as may be given the value unity in the table 
of molecular weights. 

In the following chapter we shall consider the necessary modi¬ 
fications in the theory of equilibrium, when the quantity of matter 
contained in a system is to be regarded as variable, or, if the system 
contains more than one kind of matter, when the quantities of the 
several kinds of matter in the system are to be regarded as indepen¬ 
dently variable. This will give us yet another set of variables in the 
statistical equation, corresponding to those of the amplified form of 
the thermodynamic equation. 


pi ’ = A ' T ¥r 



XV. 


SYSTEMS COMPOSED OF MOLECULES. 

The nature of material bodies is such, that especial interest attaches 
to the dynamics of systems composed of a great number of entirely 
similar particles, or, it may be, of a great number of particles of 
several kinds, all of each kind being entirely similar to each other. 
We shall therefore proceed to consider systems composed of such 
particles, whether in great numbers or otherwise, and especially to 
consider the statistical equilibrium of ensembles of such systems. 
One of the variations to be considered in regard to such systems is 
a variation in the numbers of the particles of the various kinds which 
it contains, and the question of statistical equilibrium between two 
ensembles of such systems relates in part to the tendencies of the 
various kinds of particles to pass from the one to the other. 

First of all, we must define precisely what is meant by statis¬ 
tical equilibrium of such an ensemble of systems. The essence of 
statistical equilibrium is the permanence of the number of systems 
which fall within any given limits with respect to phase. We have 
therefore to define how the term “phase” is to be understood in 
such cases. If two phases differ only in that certain entirely simi¬ 
lar particles have changed places with one another, are they to be 
regarded as identical or different phases? If the particles are re¬ 
garded as indistinguishable, it seems in accordance with the spirit 
of the statistical method to regard the phases as identical. In fact, 
it might be urged that in such an ensemble of systems as we are 
considering no identity is possible between the particles of different 
systems except that of qualities, and if v particles of one system 
are described as entirely similar to one another and to v of another 
system, nothing remains on which to base the identification of any 
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particular particle of the first system with any particular particle 
of the second. And this would be true, if the ensemble of systems 
had a simultaneous objective existence. But it hardly applies to 
the creations of the imagination. In the cases which we have been 
considering, and in those which we shall consider, it is not only 
possible to conceive of the motion of an ensemble of similar systems 
simply as possible cases of the motion of a single system, but it is 
actually in large measure for the sake of representing more clearly 
the possible cases of the motion of a single system that we use the 
conception of an ensemble of systems. The perfect similarity of 
several particles of a system will not in the least interfere with the 
identification of a particular particle in one case with a particular 
particle in another. The question is one to be decided in accordance 
with the requirements of practical convenience in the discussion of 
the problems with which we are engaged. 

Our present purpose will often require us to use the terms phase , 
density-in-phase , statistical equilibrium , and other connected terms 
on the supposition that phases are not altered by the exchange of 
places between similar particles. Some of the most important ques¬ 
tions with which we are concerned have reference to phases thus 
defined. We shall call them phases determined by generic defini¬ 
tions, or briefly, generic phases. But we shall also be obliged to 
discuss phases defined by the narrower definition (so that exchange 
of position between similar particles is regarded as changing the 
phase), which will be called phases determined by specific defini¬ 
tions, or briefly, specific phases. For the analytical description of a 
specific phase is more simple than that of a generic phase. And it 
is a more simple matter to make a multiple integral extend over all 
possible specific phases than to make one extend without repetition 
over all possible generic phases. 
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It is evident that if zq, z/ 2 , ... zq, are the numbers of the dif¬ 
ferent kinds of molecules in any system, the number of specific 
phases embraced in one generic phase is represented by the con¬ 
tinued product zq! zq! •• • zq! and the coefficient of probability of a 
generic phase is the sum of the probability-coefficients of the specific 
phases which it represents. When these are equal among themselves, 
the probability-coefficient of the generic phase is equal to that of 
the specific phase multiplied by zq! zq! ■ ■ ■ zq!. It is also evident that 
statistical equilibrium may subsist with respect to generic phases 
without statistical equilibrium with respect to specific phases, but 
not vice versa. 

Similar questions arise where one particle is capable of several 
equivalent positions. Does the change from one of these positions 
to another change the phase? It would be most natural and logi¬ 
cal to make it affect the specific phase, but not the generic. The 
number of specific phases contained in a generic phase would then 
be zq!/^ 1 • • ■ zq!zq' 1 , where zq, ... zq denote the numbers of equiva¬ 
lent positions belonging to the several kinds of particles. The case 
in which a n is infinite would then require especial attention. It 
does not appear that the resulting complications in the formulae 
would be compensated by any real advantage. The reason of this 
is that in problems of real interest equivalent positions of a particle 
will always be equally probable. In this respect, equivalent posi¬ 
tions of the same particle are entirely unlike the v\ different ways in 
which v particles may be distributed in v different positions. Let it 
therefore be understood that in spite of the physical equivalence of 
different positions of the same particle they are to be considered as 
constituting a difference of generic phase as well as of specific. The 
number of specific phases contained in a generic phase is therefore 
always given by the product zq! zq! ■ ■ ■ zq!. 
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Instead of considering, as in the preceding chapters, ensembles of 
systems differing only in phase, we shall now suppose that the sys¬ 
tems constituting an ensemble are composed of particles of various 
kinds, and that they differ not only in phase but also in the num¬ 
bers of these particles which they contain. The external coordinates 
of all the systems in the ensemble are supposed, as heretofore, to 
have the same value, and when they vary, to vary together. For 
distinction, we may call such an ensemble a grand ensemble, and 
one in which the systems differ only in phase a petit ensemble. A 
grand ensemble is therefore composed of a multitude of petit en¬ 
sembles. The ensembles which we have hitherto discussed are petit 
ensembles. 

Let U \, . ■ - Vh-, etc., denote the numbers of the different kinds 
of particles in a system, e its energy, and q \, ... q n , p \, ...p n its 
coordinates and momenta. If the particles are of the nature of ma¬ 
terial points, the number of coordinates (n) of the system will be 

equal to 3zq -f-f- 3 u h . But if the particles are less simple in their 

nature, if they are to be treated as rigid solids, the orientation of 
which must be regarded, or if they consist each of several atoms, 
so as to have more than three degrees of freedom, the number of 
coordinates of the system will be equal to the sum of zq, z/ 2 , etc., 
multiplied each by the number of degrees of freedom of the kind of 
particle to which it relates. 

Let us consider an ensemble in which the number of systems 
having zq, ... zq particles of the several kinds, and having values 
of their coordinates and momenta lying between the limits and 
Qi + dqi, pi and pi + dp \, etc., is represented by the expression 

-h ii h u h -e 

Ne © 


zq! • • • v h \ 


dpi... dq n , 


(498) 
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where N, 0, Hi, • • • A l h are constants, N denoting the total number 
of systems in the ensemble. The expression 


- \-ii h Vh-£ 

Ne © 


z/i! • • • v h \ 


(499) 


evidently represents the density-in-phase of the ensemble within the 
limits described, that is, for a phase specifically defined. The ex¬ 
pression 

fj+/nvrj— 
e © 

- - - - - dp\ ... dq n , (500) 

I'd • • • zbJ 

is therefore the probability-coefficient for a phase specifically de¬ 
fined. This has evidently the same value for all the u\\ ■ ■ ■ iyj phases 
obtained by interchanging the phases of particles of the same kind. 
The probability-coefficient for a generic phase will be u\\ ■ ■ ■ ///,,! 
times as great, viz., 

- \-fi h u h -e 

e © . (501) 

We shall say that such an ensemble as has been described is 
canonically distributed, and shall call the constant 0 its modulus. 
It is evidently what we have called a grand ensemble. The petit 
ensembles of which it is composed are canonically distributed, ac¬ 
cording to the definitions of Chapter IV, since the expression 


Q+M mij— 
e © 


v\\ ■ ■ ■ v h \ 


(502) 


is constant for each petit ensemble. The grand ensemble, therefore, 
is in statistical equilibrium with respect to specific phases. 

If an ensemble, whether grand or petit, is identical so far as 
generic phases are concerned with one canonically distributed, we 
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shall say that its distribution is canonical with respect to generic 
phases. Such an ensemble is evidently in statistical equilibrium with 
respect to generic phases, although it may not be so with respect 
to specific phases. 

If we write H for the index of probability of a generic phase in 
a grand ensemble, we have for the case of canonical distribution 


H 


Q + Hii>i + ■ ■ ■ + ~~ e 

0 


(503) 


It will be observed that the H is a linear function of e and iq, ... v h \ 
also that whenever the index of probability of generic phases in a 
grand ensemble is a linear function of e, iq, ... Vh, the ensemble is 
canonically distributed with respect to generic phases. 

The constant 12 we may regard as determined by the equation 




l^iH- VVhVh—z 



dpi... dq n , 


(504) 


or 


_n 
e e 


£ 



fj- ivH-j -jWh 

e e 


vd. ■ • • u h \ 


j all 

phases 


e _ 0 dpi... dq n , (505) 


where the multiple sum indicated by J2 Ui ■ ■ ■ '}2 l , h includes all terms 
obtained by giving to each of the symbols v \, ...Vh all integral 
values from zero upward, and the multiple integral (which is to be 
evaluated separately for each term of the multiple sum) is to be 
extended over all the (specific) phases of the system having the 
specified numbers of particles of the various kinds. The multiple 

integral in the last equation is what we have represented by e ©. 
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See equation(92). We may therefore write 


_q 
e e 


mj— \-ji h v h -4' 

Y---Y- _-_ 

^ ^ U\\ - ■ ■ is h \ 


(506) 


It should be observed that the summation includes a term in 
which all the symbols u \, ... zy t have the value zero. We must there¬ 
fore recognize in a certain sense a system consisting of no particles, 
which, although a barren subject of study in itself, cannot well be 
excluded as a particular case of a system of a variable number of 
particles. In this case e is constant, and there are no integrations 
to be performed. We have therefore* 

b a 

e _ © = e~©, i.e., ip = e. 


The value of e p is of course zero in this case. But the value of e q 
contains an arbitrary constant, which is generally determined by 
considerations of convenience, so that e q and e do not necessarily 
vanish with U \, ... 

Unless — has a finite value, our formulae become illusory. We 
have already, in considering petit ensembles canonically distributed, 
found it necessary to exclude cases in which —ip has not a finite 
valued The same exclusion would here make —ip finite for any 
finite values of iq, ... v^. This does not necessarily make a multiple 
series of the form (506) finite. We may observe, however, that if for 
all values of v \, ... vy, 

-ip < Co + CiUi -| -b c h u h , (507) 

*This conclusion may appear a little strained. The original definition of if) 
may not be regarded as fairly applying to systems of no degrees of freedom. We 
may therefore prefer to regard these equations as defining if> in this case. 

^See Chapter IV, page 39. 
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where c 0 , ci, ... Ch are constants or functions of 0 , 


_n 

e © < 


E-E 


cp+(An+ci)^i-|- 

e © 


V\\---Vh\ 


i.e. 


_n co x e 
e © < e© 2 - 


Ml+ci 

© 

z/i! 


E 




j - ® - ^ 


z.e. 


e © < e © e 1 


a 


Mh+ C h. .. 

, e u h 


i.e. 


Vt Co /H±£i 
— < — + e © 1 

0-0 


^h~\~ c h 

+ -he © 


~Vh 


(508) 


The value of —hi will therefore be finite, when the condition (507) is 
satisfied. If therefore we assume that —hi is finite, we do not appear 
to exclude any cases which are analogous to those of nature.* 

The interest of the ensemble which has been described lies in 
the fact that it may be in statistical equilibrium, both in respect 
to exchange of energy and exchange of particles, with other grand 
ensembles canonically distributed and having the same values of 0 

*If the external coordinates determine a certain volume within which the 
system is confined, the contrary of (507) would imply that we could obtain an 
infinite amount of work by crowding an infinite quantity of matter into a finite 
volume. 
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and of the coefficients /xi, /x 2 , etc., when the circumstances are such 
that exchange of energy and of particles are possible, and when 
equilibrium would not subsist, were it not for equal values of these 
constants in the two ensembles. 

With respect to the exchange of energy, the case is exactly the 
same as that of the petit ensembles considered in Chapter IV, and 
needs no especial discussion. The question of exchange of particles 
is to a certain extent analogous, and may be treated in a somewhat 
similar manner. Let us suppose that we have two grand ensem¬ 
bles canonically distributed with respect to specific phases, with the 
same value of the modulus and of the coefficients /x i, ... fx h , and let 
us consider the ensemble of all the systems obtained by combining 
each system of the first ensemble with each of the second. 

The probability-coefficient of a generic phase in the first ensem¬ 
ble may be expressed by 

- 1 -Ii h f' h -e' 

e e . (509) 

The probability-coefficient of a specific phase will then be expressed 
by 

l^i~\ - 

e © 

z/(! • • • 

since each generic phase comprises u\\ ■ ■ ■ ///,! specific phases. In 
the second ensemble the probability-coefficients of the generic and 
specific phases will be 

H"+(U1 - \-fj, h v"-e" 

e © , (511) 
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The probability-coefficient of a generic phase in the third ensem¬ 
ble, which consists of systems obtained by regarding each system 
of the first ensemble combined with each of the second as form¬ 
ing a system, will be the product of the probability-coefficients of 
the generic phases of the systems combined, and will therefore be 
represented by the formula 

n , "+ l i 1 u" , +~.+n h v'"-e'" 

e © (513) 


where Q!" = ft' + ft", e'" = e' + e", //" = is[ + //", etc. It will 
be observed that z/"', etc., represent the numbers of particles of 
the various kinds in the third ensemble, and e'" its energy; also 
that ft'" is a constant. The third ensemble is therefore canonically 
distributed with respect to generic phases. 

If all the systems in the same generic phase in the third ensemble 
were equably distributed among the //"'! • • • v "'! specific phases which 
are comprised in the generic phase, the probability-coefficient of a 
specific phase would be 


+-+ l x h v' } ['-e" 

> e 


<! ■ ■ ■ <'■ 


(514) 


In fact, however, the probability-coefficient of any specific phase 
which occurs in the third ensemble is 


e © 


v ! 


lS h \Vi 


'h- 


(515) 


which we get by multiplying the probability-coefficients of specific 
phases in the first and second ensembles. The difference between the 
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formulae (514) and (515) is due to the fact that the generic phases 
to which (513) relates include not only the specific phases occurring 
in the third ensemble and having the probability-coefficient (515), 
but also all the specifier phases obtained from these by interchange 
of similar particles between two combined systems. Of these the 
probability-coefficient is evidently zero, as they do not occur in the 
ensemble. 

Now this third ensemble is in statistical equilibrium, with respect 
both to specific and generic phases, since the ensembles from which 
it is formed are so. This statistical equilibrium is not dependent 
on the equality of the modulus and the coefficients /i i, ... /i/ t in the 
first and second ensembles, ft depends only on the fact that the two 
original ensembles were separately in statistical equilibrium, and 
that there is no interaction between them, the combining of the two 
ensembles to form a third being purely nominal, and involving no 
physical connection. This independence of the systems, determined 
physically by forces which prevent particles from passing from one 
system to the other, or coming within range of each other’s action, 
is represented mathematically by infinite values of the energy for 
particles in a space dividing the systems. Such a space may be 
called a diaphragm. 

If we now suppose that, when we combine the systems of the 
two original ensembles, the forces are so modified that the energy 
is no longer infinite for particles in all the space forming the di¬ 
aphragm, but is diminished in a part of this space, so that it is 
possible for particles to pass from one system to the other, this will 
involve a change in the function e'" which represents the energy of 
the combined systems, and the equation e'" — e' + e" will no longer 
hold. Now if the coefficient of probability in the third ensemble were 
represented by (513) with this new function e'", we should have sta- 
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tistical equilibrium, with respect to generic phases, although not to 
specific. But this need involve only a trifling change in the distri¬ 
bution of the third ensemble,* a change represented by the addition 
of comparatively few systems in which the transference of particles 
is taking place to the immense number obtained by combining the 
two original ensembles. The difference between the ensemble which 
would be in statistical equilibrium, and that obtained by combining 
the two original ensembles may be diminished without limit, while 
it is still possible for particles to pass from one system to another. 
In this sense we may say that the ensemble formed by combining the 
two given ensembles may still be regarded as in a state of (approx¬ 
imate) statistical equilibrium with respect to generic phases, when 
it has been made possible for particles to pass between the sys¬ 
tems combined, and when statistical equilibrium for specific phases 
has therefore entirely ceased to exist, and when the equilibrium 
for generic phases would also have entirely ceased to exist, if the 
given ensembles had not been canonically distributed, with respect 
to generic phases, with the same values of 0 and fi i, .../i^. 

It is evident also that considerations of this kind will apply sepa¬ 
rately to the several kinds of particles. We may diminish the energy 
in the space forming the diaphragm for one kind of particle and not 
for another. This is the mathematical expression for a “semiper- 
meable” diaphragm. The condition necessary for statistical equilib¬ 
rium where the diaphragm is permeable only to particles to which 
the suffix ( )i relates will be fulfilled when /jl\ and 0 have the same 

*It will be observed that, so far as the distribution is concerned, very large 
and infinite values of e (for certain phases) amount to nearly the same thing,— 
one representing the total and the other the nearly total exclusion of the phases 
in question. An infinite change, therefore, in the value of e (for certain phases) 
may represent a vanishing change in the distribution. 
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values in the two ensembles, although the other coefficients /i 2 , etc., 
may be different. 

This important property of grand ensembles with canonical dis¬ 
tribution will supply the motive for a more particular examination 
of the nature of such ensembles, and especially of the comparative 
numbers of systems in the several petit ensembles which make up a 
grand ensemble, and of the average values in the grand ensemble of 
some of the most important quantities, and of the average squares 
of the deviations from these average values. 

The probability that a system taken at random from a grand 
ensemble canonically distributed will have exactly iq, ... zy,, particles 
of the various kinds is expressed by the multiple integral 

- \-!JL h v h -e 

I • a11 - / --;-;- dp 1 .. . dq n , (516) 

J Phases J up. ■ ■ ■ V h \ 


or 


Q+fi ii/H- 

e © 


ffi! • • • Vh ] - 


(517) 


This may be called the probability of the petit ensemble (v \,... Vh). 
The sum of all such probabilities is evidently unity. That is, 


n+tuyi-i — h^hi'h-i’ 

Y-Y- _ 9 _ 


i, 


(518) 


which agrees with (506). 

The average value in the grand ensemble of any quantity u, is 
given by the formula 


u = 


£•••£ 


Cl+jnv l-l- \-fj, h i/ h -e 

ail f ue © 


Vl 


Vh. 


phases 


L'l' ' ' ' Vh- 


dp\ ... dq n . (519) 
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If u is a function of iq, ... zy, alone, he., if it has the same value in 
all systems of any same petit ensemble, the formula reduces to 


u = 


Vh 


O+Mivi-I-h /i h v h -ip 

ue © 

- ■ ■ V h \ 


(520) 


Again, if we write u | grand and w| petit to distinguish averages in the 
grand and petit ensembles, we shall have 

O+Atl^iH- \-[i h Vh—4> 

^Igrand “ \ | ' (^21) 

v\ "h. ' 


In this chapter, in which we are treating of grand ensembles, 
u will always denote the average for a grand ensemble. In the 
preceding chapters, u has always denoted the average for a petit 
ensemble. 

Equation (505), which we repeat in a slightly different form, viz., 


_ n 
e e 



Ml^lH-1 

e © 


Vi\ ■ ■ ■ 


dpi • • • dq n , 


(522) 


shows that is a function of 0 and ... ph\ also of the external 
coordinates ai, 02 , etc., which are involved implicitly in e. If we 
differentiate the equation regarding all these quantities as variable, 
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we have 


_ Q 

e © 


f E-E 1 




Vh 


/ all 

phases 


AljVrj- h/J.ht'h-c 

(pi^i H-h - e) e e 


dp 

“e 


!• • • 

Wj-hAtfe^fe-e 


hpi. . . dq r 


£E-E 


Vh 


+ etc. 


da i > 


phases 


all 


zq e 


e 

r i 


zq! • 

1 a 

• • Vh' 


Hivi-\ - \-fJ, h v h -e 

de 

e 

e 


dpi . .. dq n 


etc. 




phases J da\ Z/ j! • • • ///, ! 


dpi . .. dq n 


(523) 


n 

If we multiply this equation by e©, and set as usual A 1; A 2 , etc., 
for —de/dai, — de/da 2l etc., we get in virtue of the law expressed by 
equation (519), 


<m n 
e~ + ©2 


dQ _ _ 

-Q2 ^ lUl H -A Wh. -e) 

dp i _ dp 2 _ 

-Q- + -Q- ^2 + etc. 

dai — j- da 2 —r 

-Q- + -Q- ^2 + etc-; 


( 524 ) 
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that is, 

dn = n + w 1 + -~ + v h v h -i de _yy ldtll _Y^A x da,. 


0 

Since equation (503) gives 

Q + filial + ' ' ' + fJ-h^h — £ yy 

-0- = H ' 

the preceding equation may be written 

dQ = H d@ — u 1 dfii — Ai da\. 

Again, equation (526) gives 

dQ + Hi dVi + T>i dfii — de = 0 dH + H dQ. 

Eliminating dQ from these equations, we get 

de = —0 dH + Hi dv i ~~ Ay da\. 

If we set 


(525) 

(526) 

(527) 

(528) 

(529) 


T = e + 0H, (530) 

dT = de + 0dH + Hd@, (531) 

we have _ 

dA> = H dQ + Hi dv i — Ay da\. (532) 

The corresponding thermodynamic equations are 

de = T dr] + /i \ dmi — A\ dai, (533) 

-0 = e — Trj /, (534) 

d0 = —rj dT + Hi dm\ — Ai da\. (535) 
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These are derived from the thermodynamic equations (114) and 
(117) by the addition of the terms necessary to take account of 
variation in the quantities (mi, m 2 , etc.) of the several substances 
of which a body is composed. The correspondence of the equations 
is most perfect when the component substances are measured in 
such units that m \, m 2 , etc., are proportional to the numbers of the 
different kinds of molecules or atoms. The quantities /xi, /z 2 , etc., 
in these thermodynamic equations may be defined as differential 
coefficients by either of the equations in which they occur.* 

If we compare the statistical equations (529) and (532) with 
(114) and (112), which are given in Chapter IV, and discussed in 
Chapter XIV, as analogues of thermodynamic equations, we find 
considerable difference. Beside the terms corresponding to the ad¬ 
ditional terms in the thermodynamic equations of this chapter, and 
beside the fact that the averages are taken in a grand ensemble in 
one case and in a petit in the other, the analogues of entropy, H 
and 77 , are quite different in definition and value. We shall return to 
this point after we have determined the order of magnitude of the 
usual anomalies of iq, ... u/, . 

If we differentiate equation (518) with respect to fi 1 , and multi¬ 
ply by 0 , we get 


jjjynvlj- 

e 



0, (536) 


whence dVt/d/ii = — iq, which agrees with (527). Differentiating 
again with respect to fi 1 and to /i 2 , and setting 


dO _ dil 


-J- = -J- = -V2, 

d/jj 1 dfi2 


Compare Transactions Connecticut Academy, Vol. Ill, pages 136 ff. 
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we get 


£•••£ 




Vh. 


fl+fiiui-i -j-M-ji 

d 2 Vt {v\ — Xh) 2 \ e © 


d/if 


0 


L'l' ■ ■ ■ 


= 0, (537) 


n+nivi-\ — \-fi h v h -j< 

y- y> / ^ ^ (^l — ^i)(^2 — ^2) A e 0 _ 

^ © / ^l! • • • ^ft! 

vi v h 

(538) 


The first members of these equations represent the average values 
of the quantities in the principal parentheses. We have therefore 


(yi - ^i ) 2 = vl -isi = 


0 (]H 1 - 0 
d/i'f d/ii 


(ui - vi)(h > 2 ~ V2) = V1V2 ~ V1V2 = -0 


d 2 n 

d/i\ d^ 


= 0 


dui 

dfi2 


= 0 


(539) 

du 2 
d/i\ 

(540) 


From equation (539) we may get an idea of the order of magni¬ 
tude of the divergences of v\ from its average value in the ensemble, 
when that average value is great. The equation may be written 

— Vi) 2 0 dVi 

v\ v\ d/ii ’ 



The second member of this equation will in general be small when 
V\ is great. Large values are not necessarily excluded, but they must 
be confined within very small limits with respect to ji. For if 


(z/i - Ui) 2 

tt 2 


1 

> —, 


( 542 ) 
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for all values of //] between the limits fi\ and //", we shall have 
between the same limits 


0 , 

— du 1 > dfj, u 


(543) 


and therefore 



AT- 


(544) 


The difference /i" — /J,\ is therefore numerically a very small quantity. 
To form an idea of the importance of such a difference, we should 
observe that in formula (498) Hi is multiplied by iq and the product 
subtracted from the energy. A very small difference in the value 
of /ii may therefore be important. But since vQ is always less 
than the kinetic energy of the system, our formula shows that /i" — 
n \, even when multiplied by v\ or 17 ", may still be regarded as an 
insensible quantity. 

We can now perceive the leading characteristics with respect to 
properties sensible to human faculties of such an ensemble as we are 
considering (a grand ensemble canonically distributed), when the 
average numbers of particles of the various kinds are of the same 
order of magnitude as the number of molecules in the bodies which 
are the subject of physical experiment. Although the ensemble con¬ 
tains systems having the widest possible variations in respect to the 
numbers of the particles which they contain, these variations are 
practically contained within such narrow limits as to be insensible, 
except for particular values of the constants of the ensemble. This 
exception corresponds precisely to the case of nature, when certain 
thermodynamic quantities corresponding to 0, n i, /i 2 , etc., which 
in general determine the separate densities of various components 
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of a body, have certain values which make these densities indeter¬ 
minate, in other words, when the conditions are such as determine 
coexistent phases of matter. Except in the case of these particular 
values, the grand ensemble would not differ to human faculties of 
perception from a petit ensemble, viz., any one of the petit ensem¬ 
bles which it contains in which l?i, X? 2 , etc., do not sensibly differ 
from their average values. 

Let us now compare the quantities H and rj, the average values 
of which (in a grand and a petit ensemble respectively) we have seen 
to correspond to entropy. Since 

hi + i + • • • + — e 


and 


V 


H - rj 


— e 

0 ’ 

n + niVi H-b A ih^h - ^ 


0 


(545) 


A part of this difference is due to the fact that H relates to generic 
phases and r/ to specific. If we write r/ geil for the index of probability 
for generic phases in a petit ensemble, we have 


H I/gen 


I/gen = rj + log I/i ! ■ • • U h \, 

H - T] = H - ? 7 gen + log v x \ ■ ■ ■ u h \, 

0 + -I-b /dhVh-'lp 

---log I/] 


(546) 

(547) 

(548) 


This is the logarithm of the probability of the petit ensemble 
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(zq,... u h ).* If we set 




gen 


0 

which corresponds to the equation 

ip — e 


Vge 


0 


= V, 


we have 
and 


-0gen = + 01ogz/i! • • • ZqJ, 


H hgen — 


n + piui h-k Wh - i> : 


gen 


0 


This will have a maximum when^ 


d'lpgen dip gen 

= pi, —T— = P 2 , etc. 


dzq 


dzq> 


(549) 


(550) 


(551) 

(552) 


Distinguishing values corresponding to this maximum by ac¬ 
cents, we have approximately, when zq, ... Uh are of the same order 
of magnitude as the numbers of molecules in ordinary bodies, 


D + Pi^i + ' ' ' + — V’gen 

H - I/gen = -0-— 

^ + V\ V '\ + ' ' ' + P/i^ft — V’gen 

“ 0 

f d^genV(A d 2, ip ge „ V Azq Au 2 _/ d 2 -0 gen V (A^) 2 

\ du 2 / 20 ydzqdz / 2 J 0 V / 20 

(553) 


*See formula (517). 

^Strictly speaking, ip gen is not determined as function of zq, ... zq, except 
for integral values of these variables. Yet we may suppose it to be determined 
as a continuous function by any suitable process of interpolation. 
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/ d?1pgen\ / d?\fjg en \ Ai/l Ai^o f 'l~ll>gi:n \ (A Vfa ) 


gH-t) g en _ gCg ^ y 20 yd.vtd.V2 

where 


© 


V d "l ) 20 


c = 


n + ii^ h -h {i h v' h -^' 


gen 


0 


and 


Aui — Ui — u[, Az /2 = ^2 ~ z4> etc. 


(554) 

(555) 

(556) 


This is the probability of the system (iy,... /y,). The probability 
that the values of /y, ... /y, lie within given limits is given by the 
multiple integral 




( d 2 0 8 cnV(A^) 2 
l dvf ) 20 


( d, ^g en \ At^2 
l du i I 0 





20 


d,v\ ... d,Vh ■ 


( 557 ) 


This shows that the distribution of the grand ensemble with 
respect to the values of ... /y, follows the “law of errors” when 
v \, ... v' h are very great. The value of this integral for the limits ±oo 
should be unity. This gives 


e 


c 


(2vr0)2 


1, 


(558) 


C =\\ogD - ~ log(27T0), 


or 


( 559 ) 
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where 


D = 


that is, 


d 2 ip. 


gen 


du 2 


d 2 ip. 


gen 


du 2 du i 

d Ipgen 

dvh dv\ 


D = 


d/ii 
du i 
d/i 2 

dvi 

dfih 
du i 


d 2 vp. 


gen 


diq di/ 2 


<*v, 


gen 


dz'f 

d V^gen 
du h dz ^2 


d/ri 

dz/ 2 

d/U 2 

dz/ 2 


dfi h 
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d 2 xp. 


gen 


dui du h 

d Ip gen 

du 2 du h 


d 2 ip. 


gen 


H 


d/j\ 
du h 
d/i 2 
du h 

dUh 
du h 


Now, by (553), we have for the first approximation 


(560) 


(561) 


H - d gen = C - \ log D - - log(27T0), 


(562) 


and if we divide by the constant K* to reduce these quantities to 
the usual unit of entropy, 


H ~ V gen _ log D - Mog(27r0) 
K ~ 2 K 


(563) 


This is evidently a negligible quantity, since K is of the same order 
of magnitude as the number of molecules in ordinary bodies. It is 


See pages 216-219. 
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to be observed that r/ gen is here the average in the grand ensem¬ 
ble, whereas the quantity which we wish to compare with H is the 
average in a petit ensemble. But as we have seen that in the case 
considered the grand ensemble would appear to human observation 
as a petit ensemble, this distinction may be neglected. 

The differences therefore, in the case considered, between the 
quantities which may be represented by the notations* 

Hgen I grand ’ % en I grand ’ % en lpetit 

are not sensible to human faculties. The difference 


% en I petit Vspec I petit — Ul ' ' ' ' U h •> 

and is therefore constant, so long as the numbers U \, .. .Vh are con¬ 
stant. For constant values of these numbers, therefore, it is immate¬ 
rial whether we use the average of r/ gen or of rj for entropy, since this 
only affects the arbitrary constant of integration which is added to 
entropy. But when the numbers zq, ... zq are varied, it is no longer 
possible to use the index for specific phases. For the principle that 
the entropy of any body has an arbitrary additive constant is sub¬ 
ject to limitation, when different quantities of the same substance 
are concerned. In this case, the constant being determined for one 
quantity of a substance, is thereby determined for all quantities of 
the same substance. 

To fix our ideas, let us suppose that we have two identical fluid 
masses in contiguous chambers. The entropy of the whole is equal 
to the sum of the entropies of the parts, and double that of one 

*In this paragraph, for greater distinctness, H gen | grand and ri spec | petit have 
been written for the quantities which elsewhere are denoted by H and rj. 
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part. Suppose a valve is now opened, making a communication be¬ 
tween the chambers. We do not regard this as making any change 
in the entropy, although the masses of gas or liquid diffuse into one 
another, and although the same process of diffusion would increase 
the entropy, if the masses of fluid were different. It is evident, there¬ 
fore, that it is equilibrium with respect to generic phases, and not 
with respect to specific, with which we have to do in the evaluation 
of entropy, and therefore, that we must use the average of H or 
of r/ gen , and not that of 77 , as the equivalent of entropy, except in 
the thermodynamics of bodies in which the number of molecules of 
the various kinds is constant. 
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